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concerned with the design and analysis 
of explicit procedures for calculating 
the exact or approximate values of 
various functions. Such explicit pro- 
cedures are called algorithms or pro- 
grams. Because an effective notation 
for the description of programs exhibits 
considerable syntactic structure, it is 
called a programming language. 


“Much of applied mathematics, 
particularly the more recent computer- 
related areas which cut across the 
older disciplines, suffers from the lack 
of an adequate programming language. 
It is the central thesis of this book 
that the descriptive and analytic power 
of an adequate programming language 
amply repays the considerable effort 
required for its mastery. This thesis 
is developed by first presenting the 
entire language and then applying it 
in later chapters to several major 
topics. 


“The areas of application are chosen 
primarily for their intrinsic interest 
and lack of previous treatment, but 
they are also designed to illustrate the 
universality and other facets of the 
language. For example, the micropro- 
gramming of Chapter 2 illustrates the 
divisibility of the language, i.e., the 
ability to treat a restricted area using 
only a small portion of the complete 
language. Chapter 6 (Sorting) shows 
its capacity to compass a_ relatively 
complex and detailed topic in a short 
space. Chapter 7 (The Logical Cal- 
culus) emphasizes the formal manipu- 
lability of the language and its utility 
in theoretical work. 
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To My Many Teachers 


PREFACE 


Applied mathematics is largely concerned with the design and analysis of 
explicit procedures for calculating the exact or approximate values of 
various functions. Such explicit procedures are called algorithms or 
programs, Because an effective notation for the description of programs 
exhibits considerable syntactic structure, it is called a programming 
language. 

Much of applied mathematics, particularly the more recent computer- 
related areas which cut across the older disciplines, suffers from the lack 
of an adequate programming language. It is the central thesis of this book 
that the descriptive and analytic power of an adequate programming 
language amply repays the considerable effort required for its mastery. 
This thesis is developed by first presenting the entire language and then 
applying it in later chapters to several major topics. 

The areas of application are chosen primarily for their intrinsic interest 
and lack of previous treatment, but they are also designed to illustrate the 
universality and other facets of the language. For example, the micro- 
programming of Chapter 2 illustrates the divisibility of the language, i.e., 
the ability to treat a restricted area using only a small portion of the 
complete language. Chapter 6 (Sorting) shows its capacity to compass a 
relatively complex and detailed topic in a short space. Chapter 7 (The 
Logical Calculus) emphasizes the formal manipulability of the language 
and its utility in theoretical work. 

The material was developed largely in a graduate course given for 
several years at Harvard and ina later course presented repeatedly at the 
IBM Systems Research Institute in New York. It should prove suitable 
for a two-semester course at the senior or graduate level. Although for 
certain audiences an initial presentation of the entire language may be 
appropriate, I have found it helpful to motivate the development by 
presenting the minimum notation required for a given topic, proceeding 
to its treatment (e.g., microprogramming), and then returning to further 
notation. The 130-odd problems not only provide the necessary finger 
exercises but also develop results of general interest. 
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Chapter | or some part of it is prerequisite to each of the remaining 
“applications” chapters, but the applications chapters are virtually 
independent of one another. A complete appreciation of search techniques 
(Chapter 4) does, however, require a knowledge of methods of representa- 
tion (Chapter 3). The cross references which do occur in the applications 
chapters are either nonessential or are specific to a given figure, table, or 
program. The entire language presented in Chapter | is summarized for 
reference at the end of the book. 

In any work spanning several years it is impossible to acknowledge 
adequately the many contributions made by others. Two major acknowl- 
edgments are in order: the first to Professor Howard Aiken, Director 
Emeritus of the Harvard Computation Laboratory, and the second to 
Dr. F. P. Brooks, Jr. now of IBM. 

It was Professor Aiken who first guided me into this work and who 
provided support and encouragement in the early years when it mattered. 
The unusually large contribution by Dr. Brooks arose as follows. Several 
chapters of the present work were originally prepared for inclusion in a 
joint work which eventually passed the bounds of a single book and 
evolved into our joint Automatic Data Processing and the present volume. 
Before the split, several drafts of these chapters had received careful 
review at the hands of Dr. Brooks, reviews which contributed many 
valuable ideas on organization, presentation, and direction of investiga- 
tion, as well as numerous specific suggestions. 

The contributions of the 200-odd students who suffered through the 
development of the material must perforce be acknowledged collectively, 
as must the contributions of many of my colleagues at the Harvard 
Computation Laboratory. To Professor G. A. Salton and Dr. W. L. 
Eastman, I am indebted for careful reading of drafts of various sections 
and for comments arising from their use of some of the material in courses. 
Dr. Eastman, in particular, exorcised many subtle errors from the sorting 
programs of Chapter 6. To Professor A. G. Oettinger and his students 
I am indebted for many helpful discussions arising out of his early use of 
the notation. My debt to Professor R. L. Ashenhurst, now of the Univer- 
sity of Chicago, is apparent from the references to his early (and un- 
fortunately unpublished) work in sorting. 

Of my colleagues at the IBM Research Center, Messrs. L. R. Johnson 
and A. D. Falkoff, and Dr. H. Hellerman have, through their own use of 
the notation, contributed many helpful suggestions. I am particularly 
indebted to L. R. Johnson for many fruitful discussions on the applications 
of trees, and for his unfailing support. 

On the technical side, I have enjoyed the assistance of unusually compe- 
tent typists and draughtsmen, chief among them being Mrs. Arthur 
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Aulenback, Mrs. Philip J. Seaward, Jr., Mrs. Paul Bushek, Miss J. L. 
Hegeman, and Messrs. William Minty and Robert Burns. Miss Jacquelin 
Sanborn provided much early and continuing guidance in matters of style, 
format, and typography. I am indebted to my wife for assistance in 
preparing the final draft. 
KENNETH E. IVERSON 
May, 1962 
Mount Kisco, New York 
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THE LANGUAGE 


1.1) INFRODUCTION 


Applied mathematics is concerned with the design and analysis of 
algorithms or programs. The systematic treatment of complex algorithms 
requires a suitable programming language for their description, and such a 
programming language should be concise, precisc, consistent over a wide 
area of application, mnemonic, and economical of symbols: it should 
exhibit clearly the constraints on the sequence in which operations are 
performed; and it should permit the description of a process to be inde- 
pendent of the particular representation chosen for the data. 

Existing languages prove unsuitable for a variety of reasons. Computer 
coding specifies sequence constraints adequately and is also comprehensive, 
since the logical functions provided by the branch instructions can, in 
principle, be employed to synthesize any finite algorithm. However, the 
set of basic operations provided is not, in general, directly suited to the 
execution of commonly needed processes, and the numeric symbols used 
for variables have little mnemonic value. Moreover, the description 
provided by computer coding depends directly on the particular represen- 
tation chosen for the data, and it therefore cannot serve as a description of 
the algorithm per se. 

Ordinary English lacks both precision and conciseness. The widely used 
Goldstine-von Neumann (1947) flowcharting provides the conciseness 
necessary to an over-all view of the process, only at the cost of suppressing 
essential detail. The so-called pseudo-English used as a basis for certian 
automatic programming systems suffers from the same defect. Moreover, 
the potential mnemonic advantage In substituting familiar English words 
and phrases for less familiar but more compact mathematical symbols fails 
to materialize because of the obvious but unwonted precision required in 
their use. 

Most of the concepts and operations needed in a programming language 
have already been defined and developed in one or another branch of 
mathematics. Therefore, much use can and will be made of existing 
notations. However, since most notations are specialized to a narrow 
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field of discourse, a consistent unification must be provided. For example, 
separate and conflicting notations have been developed for the treatment 
of sets, logical variables, vectors, matrices, and trees, all of which may, in 
the broad universe of discourse of data processing, occur in a single 
algorithm, 

1.2) PROGRAMS 


A program statement is the specification of some quantity or quantities 
in terms of some finite operation upon specified operands. Specification is 
symbolized by an arrow directed toward the specified quantity. Thus ‘“‘y 
is specified by sin x” is a statement denoted by 


y<- Sin x. 


A set of statements together with a specified order of execution consti- 
tutes a program. The program is finite if the number of executions is 
finite. The resu/ts of the program are some subset of the quantities 
specified by the program. The sequence or order of execution will be 
defined by the order of listing and otherwise by arrows connecting any 
statement to its successor. A cyclic sequence of statements is called a /oop. 


v<2x x 3.1416 


v-UKa 


Program 1.1 Finite Program 1.2 Infinite 
program program 


Thus Program 1.1 is a program of two statements defining the result ras 
the (approximate) area of a circle of radius x, whereas Program 1.2 is an 
infinite program in which the quantity = is specified as (2y)” on the nth 
execution of the two-statement loop. Statements will be numbered on the 
left for reference. 

A number of similar programs may be subsumed under a single more 
general program as follows. At certain branch points in the program a 
finite number of alternative statements are specified as possible successors. 
One of these successors is chosen according to criteria determined in the 
statement or statements preceding the branch point. These criteria are 
usually stated as a comparison or test of a specified relation between a 
specified pair of quantities. A branch is denoted by a set of arrows leading 
to each of the alternative successors, with each arrow labeled by the 
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comparison condition under which the corresponding successor is chosen. 
The quantities compared are separated by a colon in the statement at the 
branch point, and a labeled branch is followed if and only if the relation 
indicated by the label holds when substituted for the colon. The conditions 
on the branches of a properly defined program must be disjoint and 
exhaustive. 

Program 1.3 illustrates the use of a branch point. Statement «5 is a 
comparison which determines the branch to statements #1, O1, or yl, 
according as z >> 1,2 =n, orz <n. The program represents a crude but 
effective process for determining x = n° for any positive cube n. 


yO 
k<1 


wmy xy 
zy xe 


k<-k x2 


yy tk 


ol 


Program 1.3 Program for .« = n°3 


Program 1.4 shows the preceding program reorganized into a compact 
linear array and introduces two further conventions on the labeling of 
branch points. The listed successor of a branch statement Is selected if 
none of the labeled conditions is met. Thus statement 6 follows statement 
5 if neither of the arrows (to exit or to statement 8) are followed, i.e., if 
z <n. Moreover, any unlabeled arrow is always followed; e.g., statement 
7 is invariably followed by statement 3, never by statement 8. 

A program begins at a point indicated by an entry arrow (step 1) and ends 
at a point indicated by an exit arrow (step 5). There are two useful 
consequences of confining a program to the form of a linear array: the 
statements may be referred to by a unique serial index (statement number), 
and unnecessarily complex organization of the program manifests itself in 
crossing branch lines. The importance of the latter characteristic in 
developing clear and comprehensible programs is not sufficiently appre- 
ciated. 
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1— 4<0 
k<I1 
ary xy 
ey Xe 
zZt4n 
k<«k x2 
ycytk 
k<«-k +2 


Oo Oa DD nA FP WD N 


y~yn-k 


Program 1.4 Linear arrangement of Program 1.3 


1 —) i< A) 

27>} i ®) 

3 = k <»(A) 

4 Cc; <0 

5 Ci «Cj + A,’ x BF 
6 eka 

7/[|-4 k:0 

8 jes 

g9{-] 7:0 

10 p71 


— 
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Program 1.5 Matrix multiplication 


A process which is repeated a number of times is said to be iterated, and 
a process (such as Program 1.4) which includes one or more iterated 
subprocesses is said to be iterative. Program 1.5 shows an iterative process 


for the matrix multiplication 
C~-——AB 
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defined in the usual way as 
y(A) (i 


Ci = ¥ A,’ x BY, 
r= 1 


ik 


lj 


where the dimension of an m x » rectangular matrix X (of m rows and n 
columns) is denoted by w(X) x »(X). 


Program 1.5. Steps 1-3 initialize the indices, and the loop 5-7 continues to 
add successive products to the partial sum until & reaches zero. When this 
occurs, the process continues through step 8 to decrement / and to repeat the 
entire summation for the new value of /, providing that it is not zero. If/ is zero, 
the branch to step 10 decrements / and the entire process over j and & is repeated 
from / = »(B), providing that / is not zero. If / is zero, the process is complete, 
as indicated by the exit arrow. 


In all examples used in this chapter, emphasis will be placed on clarity 
of description of the process, and considerations of efficient execution by a 
computer or class of computers will be subordinated. These considerations 
can often be introduced later by relatively routine modifications of the 
program. For example, since the execution of a computer operation 
involving an indexed variable is normally more costly than the corre- 
sponding operation upon a nonindexed variable, the substitution of a 
variable s for the variable C,’ specified by statement 5 of Program 1.5 
would accelerate the execution of the loop. The variable » would be 
initialized to zero before each entry to the loop and would be used to 
specify C,’ at each termination. 

The practice of first setting an index to tts maximum value and then 
decrementing it (e.g., the index 4 in Program 1.5) permits the termination 
comparison to be made with zero. Since zero often occurs in comparisons, 
it is convenient to omit it. Thus, if a variable stands alone at a branch 
point, comparison with zero is implied. Moreover, since a comparison on 
an index frequently occurs immediately after it is modified, a branch at the 
point of modification will denote branching upon comparison of the 
indicated index with zero, the comparison occurring after modification. 
Designing programs to execute decisions immediately after modification of 
the controlling variable results in efficient execution as well as notational 
elegance, since the variable must be present in a central register for both 
operations. 

Since the sequence of execution of statements is indicated by connecting 
arrows as well as by the order of listing, the latter can be chosen arbitrarily. 
This is illustrated by the functionally identical Programs 1.3 and 1.4. 
Certain principles of ordering may yield advantages such as clarity or 
simplicity of the pattern of connections. Even though the advantages of a 
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particular organizing principle are not particularly marked, the uniformity 
resulting from its consistent application will itself be a boon. The scheme 
here adopted ts called the method of leading decisions: the decision on each 
parameter 1s placed as early in the program as practicable, normally just 
before the operations indexed by the parameter. This arrangement groups 
at the head of each iterative segment the initialization, modification, and 
the termination test of the controlling parameter. Moreover, it tends to 
avoid program flaws occasioned by unusual values of the arguments. For 


k <-(A) +1 
Ci <0 


[| k<-k-I — 
8 Cj Cj + A, x BY 


Program 1.6 Matrix multiplication using leading decisions 


example, Program 1.6 (which is such a reorganization of Program 1.5) 
behaves properly for matrices of dimension zero, whereas Program 1.5 
treats every matrix as if it were of dimension one or greater. 

Although the labeled arrow representation of program branches 
provides a complete and graphic description, it is deficient in the following 
respects: (1) a routine translation to another language (such as a computer 
code) would require the tracing of arrows, and (2) it does not permit 
programmed modification of the branches. 

The following alternative form of a branch statement will therefore be 
used as well: 


This denotes a branch to statement number s, of the program if the relation 
wry holds, The parameters r and s may themselves be defined and re- 
defined in other parts of the program. The null element o will be used to 
denote the relation which complements the remaining relations r;; in 
particular, (©)- » (s), or simply - >s, will denote an unconditional branch to 
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statement s. Program 1,7 shows the use of these conventions in a reformu- 
lation of Program 1.6. More generally, two or more otherwise independent 
programs may interact through a statement in one program specifying a 
branch in a second. The statement number occurring in the branch must 
then be augmented by the name of the program in which the branch is 
effected. Thus the statement (») > Program 2.24 executed in Program | 
causes a branch to step 24 to occur in Program 2. 


J 
k<k—1,(7/, =) + (8,4 
CeO) tag ORs eT 


ee | 


2 i<i-i,(4, =)>@B,9) |= 
3 J (B) + I 

4 fof 1, C4, =) +6, 2) 

5 k «- oA) +1 

6 Cc, «-0 

7 

8 


Program 1.7. A reformulation of Program 1.6, using an algebraic 
statement of the branching 


One statement in a program can be modified by another statement 
which changes certain of its parameters, usually indices. More general 
changes in statements can be effected by considering the program Itself as a 
vector p whose components are the individual, serially numbered statc- 
ments. All the operations to be defined on general vectors can then be 
applied to the statements themselves. For example, the jth statement can 
be respecified by the ‘th through the occurrence of the statement p, <— p,. 

The interchange of two quantities y and wr (that is, x specifies y and the 
original value of y specifies x) will be denoted by the statement 4 « >.r. 


13 STRUCTURE OF THE LANGUAGE 


Conventions 


The Summary of Notation at the end of the book summarizes the nota- 
tion developed in this chapter. Although intended primarily for reference, 
it supplements the text in several ways. It frequently provides a more 
concise alternative definition of an operation discussed in the text, and it 
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also contains important but easily grasped extensions not treated explicitly 
in the text. By grouping the operations into related classes it displays 
their family relationships. 

A concise programming language must incorporate families of opera- 
tions whose members are related in a systematic manner. Each family will 
be denoted by a specific operation symbol, and the particular member of the 
family will be designated by an associated controlling parameter (scalar, 
vector, matrix, or tree) which immediately precedes the main operation 
symbol. The operand is placed immediately after the main operation 
symbol. For example, the operation & * » (left rotation of « by & places) 
may be viewed as the Ath member of the set of rotation operators denoted 
by the symbol *. 

Operations involving a single operand and no controlling parameter 
(such as Lx], or [2]) will be denoted by a pair of operation symbols which 
enclose the operand. Operations involving two operands and a controlling 
parameter (such as the mask operation /a, tt, b/) will be denoted by a pair 
of operation symbols enclosing the entire set of variables, and the con- 
trolling parameter will appear between the two operands. In these cases 
the operation symbols themselves serve as grouping symbols. 

In interpreting a compound operation suchas k *(/ 4) itis important to 
recognize that the operation symbol and its associated controlling param- 
eter together represent an indivisible operation and must not be separated. 
It would, for example, be incorrect to assume that / * (A , ¥) were 
equivalent toA * (j , ¥), although it can be shown that the complete opera- 
tions j | and k + do commute, that is, kA t(j, *) =f. (A 7%). 

The need for parentheses will be reduced by assuming that compound 
statements are, except for intervening parentheses, executed from right to 
left. Thus & fj , ¥ is equivalent tok *(/,¥*), notto (A Ts), *. 

Structured operands such as vectors and matrices, together with a 
systematic component-by-component generalization of elementary opera- 
tions, provide an important subordination of detail in the description of 
algorithms. The use of structured operands will be facilitated by se/ection 
operations for extracting a specified portion of an operand, reduction 
operations for extending an operation (such as logical or arithmetic 
multiplication) over all components, and permutation operations for 
reordering components. Operations defined on vectors are extended to 
matrices: the extended operation is called a row operation if the under- 
lying vector operation is applied to each row of the matrix and a column 
operation if it is applied to each column. A column operation Is denoted 
by doubling the symbol employed for the corresponding row (and vector) 
operation. 

A distinct typeface will be used for each class of operand as detailed in 
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Table 1.8. Special quantities (such as the prefix vectors @/ defined in See. 
1.7) will be denoted by Greck letters in the appropriate typeface. For 
mnemonic reasons, an operation closely related to such a special quantity 


Type of Representation 
Opernd Printed Typed 
Literal 
Alphabetic Roman, u.c. and Lc. Circled u.c. and I.c. roman. 
Numeric Standard numeral Standard numeral 
Variable 
Alphabetic Italic, u.c. and Lc. Unmarked 
Numeric Italic numeral Underscore 
Vector (.c. boldface italic Underscore 
Matrix u.c. boldface italic Underscore 
Tree u.c. boldface roman Wavy underscore 


Table 1.8 Typographic conventions for classes of operands 


will be denoted by the same Greek letter. For example, x/t denotes the 
maximum prefix (Sec. 1.10) of the logical vector u. Where a Greek letter 
is indistinguishable from a Roman, sanserif characters will be used, e.g., 
E and J for the capitals epsilon and iota. 


Literals and variables 


The power of any mathematical notation rests largely on the use of 
symbols to represent general quantities which, in given instances, are 
further specified by other quantities. Thus Program 1.4 represents a 
general process which determines x = 7° for any suitable value of 7. Ina 
specific case, say n = 27, the quantity x 1s specified as the number 9. 

Each operand occurring in a meaningful process must be specified 
ultimately in terms of commonly accepted concepts. The symbols 
representing such accepted concepts will be called /itera/s. Examples of 
literals are the integers, the characters of the various alphabets, punctua- 
tion marks, and miscellaneous symbols such as $ and %. The literals 
occurring in Program 1.4 are 0, 1, and 2. 

It is important to distinguish clearly between general symbols and 
literals. In ordinary algebra this presents little difficulty, since the only 
literals occurring are the integers and the decimal point, and each general 
symbol employed includes an alphabetic character. In describing more 
general processes, however, alphabetic literals (such as proper names) also 
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appear. Moreover, in a computer program, numeric symbols (register 
addresses) are used to represent the variables. 

In general, then, alphabetic literals, alphabetic variables, numeric 
literals, and numeric variables may all appear in a complex process and 
must be clearly differentiated. The symbols used for literals will be roman 
letters (enclosed in quotes when appearing in text) and standard numerals. 
The symbols used for variables will be italic letters, italic numerals, and 
boldface letters as detailed in Table 1.8. Miscellaneous signs and symbols 
when used as literals will be enclosed in quotes in both programs and text. 

It is sometimes desirable (e.g., for mnemonic reasons) to denote a 
variable by a string of alphabetic or other symbols rather than by a single 
symbol]. The monolithic interpretation of such a string will be indicated 
by the te used in musical notation, thus: jv, inv, and INV may denote 


Nee ee 
cy 


the variable “inventory,” a vector of inventory values, and a matrix of 
inventory values, respectively. 

In the set of alphabetic characters, the space plays a special role. For 
other sets a similar role is usually played by some one element, and this 
element is given the special name of null element. In the set of numeric 
digits, the zero plays a dual role as both null element and numeric quantity. 
The null element will be denoted by the degree symbol «. 

In any determinate process, each operand must be specified ultimately in 
terms of literals. In Program 1.4, for example, the quantity k is specified 
in terms of known arithmetic operations (multiplication and division) 
involving the literals | and 2. The quantity 7, on the other hand, is not 
determined within the process and must presumably be spectfied within 
some larger process which includes Program 1.4. Such a quantity is called 
an argument of the process. 


Domain and range 

The class of arguments and the class of results of a given operator are 
called its domain and range, respectively. Thus the domain and range of 
the magnitude operation (|z|) are the real numbers and the nonnegative 
real numbers, respectively. 

A variable is classified according to the range of values it may assume: 
it 1s logical, integral, or numerical, according as the range is the set of 
logical variables (that is, 0 and [), the set of integers, or the set of real 
numbers. Each of the foregoing classes is clearly a subclass of each class 
following it, and any operation defined on a class clearly applies to any of 
its subclasses. A variable which is nonnumeric will be called arbitrary. In 
the Summary of Notation, the range and domain of each of the operators 
defined is specified in terms of the foregoing classes according to the 
conventions shown in Sec. S.1}. 
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1.4 ELEMENTARY OPERATIONS 


The clementary operations employed include the ordinary arithmetic 
operations, the elementary operations of the logical calculus, and the 
residue and related operations arising in elementary number theory. In 
defining operations in the text, the symbol <=> will be used to denote 
equivalence of the pair of statements between which it occurs. 


Arithmetic operations 


The ordinary arithmetic operations will be denoted by the ordinary 
symbols +, —, X, and — and defined as usual except that the domain and 
range of multiplication will be extended slightly as follows. If one of the 
factors is a logical variable (0 or I), the second may be arbitrary and the 
product then assumes the value of the second factor or zero according as 
the value of the first factor (the Jogical variable) is | or 0. Thus if the 
arbitrary factor is the literal ‘‘q,” then 


Oxgq=qx0=0 
and Ixq=qxl=q. 


According to the usual custom in ordinary algebra, the multiplication 
symbol may be elided. 


Logical operations 


The elementary logical operations and, or, and not will be denoted by A, 

Vv, and an overbar and are defined in the usual way as follows: 
w<uAcv<>w=t ifand only if w= lande= tl, 
w<e-eu YVe<>w = 1 ifandonlyif w=1 orv=1], 
wou <>we=1 ifand only if uv =0, 

If x and y are numerical quantities, then the expression x << y implies 
that the quantity x stands in the relation “less than” to the quantity y. 
More generally, if « and # are arbitrary entities and # is any relation 
defined on them, the re/ational statement (%.#/3) is a logical variable which 
is true (equal to I) if and only if x stands in the relation .# to f. For 
example, if x is any real number, then the function 


(x > 0) — (2 < 0) 


(commonly called the sign function or sgn x) assumes the values |, 0, or 
—1 according as x is strictly positive, 0, or strictly negative. Moreover, 
the magnitude function |x| may be defined as |x] =x xX sgna =x x 


((x = 0) — (x < 0). 
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The relational statement is a useful generalization of the Kronecker 
delta, that is, 6,/ = ( = /). Moreover, it provides a convenient expression 
for a number of familiar logical operations. The exc/usive or, for example, 
may be denoted by (u + r), and its negation (i.e., the equivalence function) 
may be denoted by (uv = vr). 


Residues and congruence 


For each set of integers n, j, and 6, with b > 0, there exists a unique pair 
of integers g and r such that 


n=b¢tr, fjsr<jts. 


The quantity r is called the /-residue of n modulo b and is denoted by b ; a 
For example, 3|,9 = 0, 3|,9 = 3, and 3|, 10 = 1. Moreover, if # > 0, 
then 4 |, is the remainder obtained in dividing ” by b and q is the integral 
part of the quotient. A number # is said to be of even parity if its 0-residue 
modulo 2 is zero and of odd parity if 2 lo a=l. 

If two numbers # and m have the same /-residue modulo 4, they differ 
by an integral multiple of b and therefore have the same k-residue modulo 
bforanyk. Ifb| n= li m, then m and sare said to be congruent mod b. 
Congruency is transitive and reflexive and is denoted by 


m =n(mod Bb). 


In classical treatments, such as Wright (1939), only the 0-residue is 
considered. The use of l-origin indexing (cf. Sec. 1.5) accounts for the 
interest of the |-residue. 

A number represented in a positional notation (e.g., in a base ten or a 
base two number system) must, in practice, employ only a finite number of 
digits. It is therefore often desirable to approximate a number x by an 
integer. For this purpose two functions are defined: 


1. the floor of x (or integral part of .r), denoted by [x] and defined as the 
Jargest integer not exceeding +, 

2. the ceiling of x, denoted by [| and defined as the smallest integer not 
exceeded by x. 


Thus 
[3.14J,=4, [3.14)=3, [-3.14J = —4, 
[3.00] = 3, [3.00] = 3, [|—3.00] = —3. 


Clearly [x] = —|[—2x]and[z] <r < fe]. Moreover, = bl7 + 6] + 6 lo a 
for all integers #. Hence the integral quotient la + bl is equivalent to the 
quantity g occurring in the definition of the j-residue for the case j = 0. 
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Structured operands 


1.5 STRUCTURED OPERANDS 


Elementary operations 


Any operation defined on a single operand can be generalized to apply 
to each member of an array of related operands. Similarly, any binary 
operation (defined on two operands) can be generalized to apply to pairs 
of corresponding elements of two arrays. Since algorithms commonly 
incorporate processes which are repeated on each member of an array of 
operands, such generalization permits effective subordination of detail in 
their description. For example, the accounting process defined on the 
data of an individual bank account treats a number of distinct operands 
within the account, such as account number, name, and balance. More- 
over, the over-all process is defined on a large number of similar accounts, 
all represented in a common format. Such structured arrays of variables 
will be called structured operands, and extensive use will be made of three 
types, called vector, matrix, and tree. As indicated in Sec. S.1 of the 
Summary of Notation, a structured operand is further classified as /ogical, 
integral, numerical, or arbitrary, according to the type of elements it 
contains. 

A vector x is the ordered array of elements (4%, ¥, %3,... Xe). The 
variable x, is called the ith component of the vector x, and the number of 
components, denoted by +(x) (or simply » when the determining vector ts 
clear from context), is called the dimension of x. Vectors and their com- 
ponents will be represented in lower case boldface italics. A numerical 
vector x may be multiplied by a numerical quantity & to produce the 
scdlar multiple k X x (or kx) defined as the vector z such that z, =k X «,. 

All elementary operations defined on individual variables are extended 
consistently to vectors as component-by-component operations. For 
example, 


ZSX4 Vo>s, = H#,4+ V,, 
Z=ExNXXK VSsz =X; X Y,, 
ZSKX > Ys, = HK, TY, 
% = [s|<>2, = [x], 
w= A vV<>w, =U, A V,, 
w= («<< yw, = (4, -< Y,). 


Thus if * 


(1.0, 1,1) and y = (0, I 
x A y = (0,0, 1,0), and (x -< y) = (0. 1 


,1,0) then « + y = (1, 1, 2, 1), 


, 9, 0). 
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Matrices 


A matrix M is the ordered two-dimensional array of variables 


1 1 1 
M}, M,, ..., Mia 
2 2 2 
M2, My, ..., Mm 

M i(M) 
MK 7 ry Mim) 


The vector (M,', M,',..., M,") is called the ith row rector of M and is 
denoted by M’. Its dimension »(M) is called the row dimension of the 
matrix. The vector (M,!, M),..., M;") is called the jth co/umn rector 
of M and 1s denoted by M,. Its dimension u(M) is called the co/umn 
dimension of the matrix. 

The variable M,’ ts called the (7, /)th component or element of the matrix. 
A matrix and its elements will be represented by upper case boldface 
italics. Operations defined on each element of a matrix are generalized 
component by component to the entire matrix. Thus, if © is any binary 


Operator P=Mo N<>P; = M;'0ON,’. 


Index systems 


The subscript appended to a vector to designate a single component is 
called an index, and the indices are normally chosen as a set of successive 
integers beginning at I, that is, * = (4, ",...,4,). It is, however, 
convenient to admit more general j-origin indexing in which the set of 
successive integers employed as indices in any structured operand begin 
with a specified integer /. 

The two systems of greatest interest are the common |-origin system, 
which will be employed almost exclusively in this chapter, and the 0-origin 
system. The latter system is particularly convenient whenever the index 
itself must be represented in a positional number system and will therefore 
be employed exclusively in the treatment of computer organization in 
Chapter 2. 


1.6 ROTATION 


The /eft rotation of a vector * is denoted by & + x and specifies the 
vector obtained by a cyclical left shift of the components of « by k places. 
Thus if @=(1,2,3,4,5,6), and b=(c,a,n,d.y), then 24 a= 
(3, 4,5, 6,1,2),and3*b=8& * b=(d,y,c,a,n). Formally,* 


Zeeks x<>s, =H, where j = 7|,(/ + k). 


* Restating the relation in terms of the O-residue will illustrate the convenience of the 
]-residue used here. 
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Right rotation is denoted by k | x and is defined analogously. Thus 
z=k\ x<>d, = x, where j = v|,(/ — &). 


If A = 1, it may be elided. Thus ¢ b = (a, n, d, y.c). 
Left rotation is extended to matrices in two ways as follows: 


A<j* B<>A' =j,}t B 

C—k} B<>-C, =k, * B,. 
The first operation is an extension of the basic vector rotation to each row 
of the matrix and 1s therefore called row rotation, The second operation ts 


the corresponding column operation and is ‘therefore denoted by the 
doubled operation symbol *. For example, if 


k = (0, |, 2), 
and 
a b¢ 
B=|de f 
g hot 
then 
ab ¢ aoe tf 
ki B=le f d and kf} Ba=td ih ¢ 
i ogi ih g b f 


Right rotation is extended analogously. 


1.7 SPECIAL VECTORS 


Certain special vectors warrant special symbols. In each of the following 
definitions, the parameter # will be used to specify the dimension. The 
interval vector U(n) 1s defined as the vector of integers beginning with /. 
Thus 04) = (0, 1, 2, 3), o(4) = (1, 2, 3.4), and t (5) = (—7, —6. —5, 
—4, —3). Four types of logical vectors are defined as follows. The /th 
unit vector en) has a one in the jth position, that is, (€()), = (A = /). 
The full vector e(#) consists of all ones. The vector consisting of all zeros 
is denoted both by 0 and by €(#). The prefix vector of weight j is denoted 
by @/(7) and possesses ones in the first & positions, where & is the lesser of / 
and vn. The suffix vector wi() is defined analogously. Thus €°(3) = 
(0, 1, 0). €(4) = (1, 1, 1, 1), @°(S) = CI, 1, 1, 0.0), w3(5) = (0,0, 1, 1, 1). and 
a(5) = @(5) = (1, 1. 1,1, 1). Moreover, w'(n) = jt a(n), and a'(7) = 


Jy o'(2). 
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A logical vector of the form @”(7) A o3/(7) is called an infix vector. An 
infix vector can also be specified in the form / , @'(), which displays its 
weight and location more directly. 

An operation such as x A y is defined only for compatible vectors x and 
y, that is, for vectors of like dimension. Since this compatibility 
requirement can be assumed to specify implicitly the dimension of one of 
the operands, elision of the parameter # may be permitted in the notation 
for the special vectors. Thus, if y = (3, 4, 5, 6, 7), the expressions € x y 
and €/ x y imply that the dimensions of € and e/ are both 5. Moreover, 
elision of / will be permitted for the interval vector v(v) (or t’), and for the 
residue operator |; when / is the index origin in use. 

It is, of course, necessary to specify the index origin in use at any given 
time. For example, the unit vector €(5) is (0,0, 1,0,0) in a I-origin 
system and (0, 0,0, 1,0) in a O-origin system, even though the definition 
(that is, (€(7)), = (A =/)) remains unchanged. The prefix and suffix 
vectors are, of course, independent of the index origin. Unless otherwise 
specified, |-origin indexing will be assumed. 

The vector €(0) is a vector of dimension zero and will be called the su// 
vector. It should not be confused with the special null element ». 


1.8 REDUCTION 


An operation (such as summation) which is applied to all components 
of a vector to produce a result of a simpler structure is called a reduction. 
The “-reduction of a vector * is denoted by © /* and defined as 


ax /¥eos = (0 (Ce, Hy) 


where ‘> is any binary operator with a suitable domain. Thus +/* 1s the 
sum, X/x is the product, and v/# is the logical sum of the components of 
a vector x. For example, x/u'(S5)= 1 x 2x 3 x 4x 5, x/U(n) =a, 
and +/u(7) = n(n + 1)/2. 

As a further example, De Morgan’s law may be expressed as A/u = 
v fu, where # is a logical vector of dimension two. Moreover, a simple 
inductive argument (Exercise 1.10) shows that the foregoing expression is the 
valid generalization of De Morgan’s law for a logical vector u of arbitrary 
dimension. 

A relation -# incorporated into a relational statement (7-#7y) becomes, 
in effect, an operator on the variables x and y. Consequently, the reduction 
A|x can be defined in a manner analogous to that of ©/x, that is, 


Aix = (0+ + (8%) Ax.) AX) AF +++). #x,). 


The parentheses now imply relational statements as well as grouping. 
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The relational reductions of practical interest are -//u, and =/u, the 
exclusive-or and the equivalence reduction, respectively. 

The inductive argument of Exercise 1.10 shows that fu = 2 \y(+/u). 
For example, if u = (1,0, 1, [, 0), then 


//u= (C(t 4 0) 4 I) ¥ 1) 0) 
= (1 # 1) 4 1) #9) 
= ((0.41).40) 
=(1 40)=1, 


and 2|,(+/u) = 2|,3 = 1. Similarly, =/u = 2 |) (+/u), and as a con- 
sequence, 

fu = =u, 
a useful companion to De Morgan’s law, 

To complete the system it is essential to define the value of (/e(0), the 
reduction of the null vector of dimension zero, as the identity element of 
the operator or relation ©. Thus +/e(0) = v/e(0) = 0, and x/e(0) = 
A /e(O) = I. 

A reduction operation is extended to matrices in two ways. A row 
reduction of a matrix X by an operator “> is denoted by 


yo ofX 
and specifies a vector y of dimension 4(X) such that y, = 0/X". A 
column reduction of X is denoted by z <— ©//X and specifies a vector z of 


dimension »(X) such that z, = ©/X,. 
For example, if 


1 0 ft 0 
U={0 01 1 
t 1 1 0 


then +/U = (2, 2,3), +//U = (2.1.3.1), A//U = (0.0, 1,0), “/U = 
(0,0, 1), =//U = (0, t, 1, 1), and +/(=//U) = 3. 


1.9 SELECTION 


Compression 


The effective use of structured operands depends not only on generalized 
operations but also on the ability to specify and select certain elements or 
groups of elements. The selection of single elements can be indicated by 
indices, as in the expressions v,, M’, M,, and M,’. Since selection is a 
binary operation (i.e., to select or not to select), more general selection is 
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conveniently specified by a logical vector, each unit component indicating 
selection of the corresponding component of the operand. 

The selection operation defined on an arbitrary vector a and a compat- 
ible (i.e., equal in dimension) logical vector u is denoted by ¢ <— u/a@ and 
is defined as follows: the vector ¢ is obtained from a by suppressing from 
a each component a, for which u, = 0. The vector u is said to compress 
the vector a. Clearly »(¢) = +/u. For example, if u = (1,0, 0,0, 1, 1) 
and a = (M,0,n,d, a, y), then u/a = (M, a, y). Moreover, if # is even 
and v = (2e)|,U(7) = (1, 0, 1,0, 1,...), then v/l(n) = (1, 3, 5,...,.47 — 1), 
and +/(v/t'(a)) = (n/2). 

Row compression of a matrix, denoted by u/A, compresses each row 
vector A’ to form a matrix of dimension «(A) x +/u. Column compres- 
sion, denoted by u//A, compresses each column vector A, to form a 
matrix of dimension +/u x (A). Compatibility conditions are »(u) = 
v(A) for row compression, and »(u) = u(A) for column compression. 
For example, if A is an arbitrary 3 x 4 matrix, uw = (0, 1,0, 1) and 
v = (1,0, 1); then 


A) Aj ‘ 
i 2 A,’ Aj’ Aj) Ay 
u/A=|[ A,” AP], v//A= : : ; i; 
; ; Ay A,” A; Ay 
A; A? 
A, Ay 
and u/v//A = v//u/A = 


It is clear that row compression suppresses columns corresponding to 
zeros of the logical vector and that column compression suppresses rows. 
This illustrates the type of confusion in nomenclature which is avoided by 
the convention adopted in Sec. 1.3: an operation ts called a row operation 
if the underlying operation from which it is generalized is applied to the 
row vectors of the matrix, and a co/umn operation ifit is applied to columns. 


Example 1.1. A bank makes a quarterly review of accounts to produce the 
following four lists: 


1, the name, account number, and balance for each account with a balance 
less than two dollars. 


te 


. the name, account number, and balance for each account with a negative 
balance exceeding one hundred dollars. 


3, the name and account number of each account with a balance exceeding 
one thousand dollars. 


4. all unassigned account numbers. 
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The ledger may be described by a matrix 
L!} 


L= (1), L,, Ly) = 


be 


with column vectors £,, Ly, and L, representing names, account numbers, and 
balances, respectively, and with row vectors Y BL Deemer BL representing 
individual accounts. An unassigned account number is identified by the word 
“none” in the name position. The four output lists will be denoted by the 
matrices P, Q, R, and S, respectively. They can be produced by Program 1.9. 


Program 1.9, Since LE, is the vector of balances, and 2€ is a compatible vector 
each of whose components equals two, the relational statement (Ls - 2€) defines 
a logical vector having unit components corresponding to those accounts to be 


[ 
1 — P<(L, <26)//L L Bank ledger. 
2 Q <(L, < —1006)//L Lh kth account. 
L,! 
3 R «(Ly > 1000 ©)//e2/L 3 Balance of Ath account. 
L, Account number of kth 
4 S <—(L, = none e)//e?/L | account. 
L, Name of kth account or 
“*‘none”’ if account number 
D,* unused, 


Legend 
Program 1.9 Selection on bank ledger L (Example 1.1) 


included in the list P. Consequently, the column compression of step I selects 
the appropriate rows of LZ to define P. Step 2 is similar, but step 3 incorporates 
an additional row compression by the compatible prefix vector a? = (1, 1, 0) to 
select columns one and two of L. Step 4 represents the comparison of the name 
(in column £,) with the literal ‘none,’ the selection of each row which shows 
agrecment, and the suppression of all columns but the second. The expression 
“none €° occurring in step 4 illustrates the use of the extended definition of 
multiplication. 


Mesh, mask, and expansion 


A logical vector w and the two vectors @ = u/¢ and b = u/c, obtained 
by compressing a vector c, collectively determine the vector c. The 
Operation which specifies ¢ as a function of a, b, and tis called a mesh and 
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is defined as follows: If @ and 6 are arbitrary vectors and if uw is a logical 
vector such that +/u = (a) and +/u = 9(b), then the mesh of a and b on 
u is denoted by ‘a, u, 6) and is defined as the vector ¢ such that u/c = a 
and u/c = b. The mesh operation is equivalent to choosing successive 
components of ¢ from a4 or Bb according as the successive components of u 
areQor |. If. forexample, a = (s, e.k). b = (t, a), and u = (0. 1,0, 1, 0), 


then ‘a, u,b: = (s,t.e.a,k). As a further example, Program 1.10a 
1 —| i <0 1) u-(Gelt) #0) 
2 j —_— 0 2 c _— \a,u,b\ 
k<0 
; 7 (6) 
4 — &k: oa) +b) 
5 k<-k+1 1 
6 3k 1 = a, b | Given vectors. 
7 joj+i c C = (44,B,,B5,42,b3,B4,43, . . »). 
b i Index of a. 
8 ee 05 ] Index of b. 
9 iei+l k Index of e. 
10 '— Gx Gs; u u = (0,1,1,0,1,1,0, .. .). 
(a) 


Legend 


Program 1.10 Interfiling program 


(which describes the merging of the vectors a and b, with the first and every 
third component thereafter chosen from 4) can be described alternatively 
as shown in Program 1.10/. Since U = (1, 2,3.4.5,6,...), then 
(3€) |gul = (1, 2, 0, 1, 2, 0,...), and consequently the vector u specified by 
step | is of the form uw = (0,1, 1,0. 1. 1,0,...). 

Mesh operations on matrices are defined analogously, row mesh and 
column mesh being denoted by single and double reverse virgules, respec- 
tively. 

The catenation of vectors *, y,....% is denoted by x Dy Qe: Oz 
and ts defined by the relation 


Nay Dy. = 
FOV @%= (%, 0, ¥exyr Mis Var +s Bcxy)- 


Catenation is clearly associative and for two vectors x and y it is a special 
case of the mesh \x, wu, y\ in which w is a suffix vector. 
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In numerical vectors (for which addition of two vectors is defined), the 
effect of the general mesh operation can be produced as the sum of two 
meshes, each involving one zero vector. Specifically, 


‘a, uly = ix, u,0\ + \0, uy 
= 0, u,¥, + \0, uw, y\. 


The operation \0, uw. y\ proves very useful in numerical work and will be 
called expansion of the vector y, denoted by u\y. Compression of uy by 
wand by # clearly yield y and 0, respectively. Moreover, any numerical 
vector ¥ can be decomposed by a compatible vector wu according to the 
relation 

x= uiu/x + u\u/x, 


The two terms are vectors of the same dimension which have no nonzero 
components incommon. Thus if a = (1,0, 1,0, 1), the decomposition of 
¥ appears as 

¥ = (0, ¥, 0, ¥y, 0) + (4), 0, ¥5, 0, *5). 


Row expansion and column expansion of matrices are defined and 
denoted analogously. The decomposition relations become 


X = wu/X + u\u/X, 
and X =w\ul/X + u\\ul/X. 


The mask operation is defined formally as follows: 
c«fa,u,b/<2u/e =u/a, and ufc = ulb. 


The vectors c, a, u, and 6 are clearly of a common dimension and c, = a, 
or 6, according as u, = 0 or u, = |. Moreover, the compress, expand, 
mask, and mesh operations on vectors are related as follows: 


/a,u, bf = \n/a, u, u/b\, 


‘a, u, b\ = Ju\a, u, u\b/. 


Analogous relations hold for the row mask and row mesh and for the 
column mask and column mesh. 

Certain selection operations are controlled by logical matrices rather 
than by logical vectors. The row compression U/A selects elements of A 
corresponding to the nonzero elements of U. Since the nonzero elements 
of U may occur in an arbitrary pattern, the result must be construed as a 
vector rather than a matrix. More precisely, U/A denotes the catenation 
of the vectors U’/A’ obtained by row-by-row compression of A by U. 
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The column compression U//A denotes the catenation of the vectors U,/A,. 
If, for example, 


O11 01 1 

U=]1 1 0 0 0 

Oo !t 1 0 0 
then U/A = (A,!, Ay’, A;', AZ, A,2, A,3, Ay), 
and U//A = (A,2, Ay!, A, Ay3, As3, Ay), A5!). 


Compression by the full matrix E (defined by E = 0) produces either a 
row list (E/A) or a column list (E//A) of the matrix A. Moreover, a numeri- 
cal matrix X can be represented jointly by the logical matrix U and the 
row list U/X (or the column list U//X), where U = (X / Q). Ifthe matrix 
X is sparse (i.e., the components are predominantly zero), this provides a 
compact representation which may reduce the computer storage required 
for X. 

The compression operations controlled by matrices also generate a group 
of corresponding mesh and mask operations as shown in Sec. 8.9. 


1.10 SELECTION VECTORS 


The logical vector u involved in selection operations may itself arise in 
various ways. It may be a prefix vector #/, a suffix wv’, or an infix (/ | a’); 
the corresponding compressed vectors a//*, w’/x, and (/ | @/)/* are called 
a prefix, suffix, and infix of x, respectively. 

Certain selection vectors arise as functions of other vectors, e.g., the 
vector (* °:: 0) can be used to select all nonnegative components of *, and 
(b /% *€) serves to select all components of b which are not equal to the 
literal “‘*.” Two further types are important: the selection of the longest 
unbroken prefix (or suffix) of a given logical vector, and the selection of the 
set of distinct components occurring in a vector. The first is useful in left 
(or right) justification or in a corresponding compression intended to 
eliminate leading or trailing “filler components” of a vector (such as left 
zeros in a number or right spaces in a short name). 

For any logical vector u, the maximum prefix of u is denoted by z/u and 
defined as follows: 

va glucry =a, 


where j is the maximum value for which A /(a//u) = 1. The maximum 
suffix is denoted by ™/u and is defined analogously. If, for example, 
u = (1,1, 1,0, 1, 1, 0,0, 1, 1), then = a/t = (1, 1, 1, 0, 0, 0, 0, 0, 0, 0), 
m/u = (0, 0, 0, 0, 0, 0,0, 0,1, 1), +/a/u = 3, and +/m/u = 2, 
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The leading zeros of a numerical vector x can clearly be removed either 
by compression: 
y< (4[(x = 0) /*, 


or by left justification (normalization): 
z< (+/x/(~¥ = 0)) tx, 


The extension of the maximum prefix operation to the rows of a logical 
matrix U is denoted by x/U and defined as the compatible logical matrix 
V. such that V’ = 2/U’. The corresponding maximum column prefix 
operation is denoted by z//U. Right justification of a numerical matrix X 
is achieved by the rotation k y X, where k = +/m/(X = 0), and top 
justification is achieved by the rotation (+//z//(X = 0)) 4 X (sce See. S.6.) 

A vector whose components are all distinct will be called an ordered set. 
The forward set selector on b is a logical vector denoted by o/b and defined 
as follows: the statement v <— o/b implies that v; = 1 if and only if 6, 
differs from all preceding components of b. Hence v/b is a set which 
contains all distinct components of 6, and +/v/t is a minimum. For 
example, if e = (C, a, n, a, d, a), then (a/¢)/e = (C, a, n, d) is a list of the 
distinct letters in ¢ in order of occurrence. Clearly (o/b)/b = b if and only 
if 6 is a set. 

The backward set selector 7/b is defined analogously (c.g., (r/ey/e = 
(C.n, d,a)). Forward and backward set selection are extended to matrices 
by both rows (0/B, and 7/B) and columns (o//B, and 7//B) in the estab- 
lished manner. 


1.110 THE GENERALIZED MATRIX PRODUCT 


The ordinary matrix product of matrices X and Y is commonly denoted 
by XY and defined as follows: 


y(X) ; i =| 
ZOoXVsrZi= YX x VR 
hod J 


I 
is in) 
= 


Jt can be defined alternatively as follows: 
(XY) = +/(X' x ¥)). 


This formulation emphasizes the fact that matrix multiplication incorpor- 
ates two elementary operations (+, x) and suggests that they be displayed 
explicitly. The ordinary matrix product will therefore be written as 
XY, 
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More generally, if ©, and ©, are any two operators (whose domains 
include the relevant operands), then the generalized matrix product 
X .' ¥Y is defined as follows: 


(K EY) = O,f(X'O,Y)), = ee 
For example, if rae 
{ 3 2 0 
A=[2 1 0 1 and B= ce 
400 2 eae 
2 0 
4 14 0 | 
then AiB=|10 5], AL B=]0 Of, 
20 4 1 0 
1 0 4 6 
A> B={1 1], and (A-~0)+ B= ]6 4 
(ae 6 I 


The generalized matrix product and the selection operations together 
provide an elegant formulation in several established areas of mathe- 
matics. A few examples will be chosen from two such areas, symbolic 
logic and matrix algebra. 

In symbolic logic, De Morgan’s laws (A/u = v/u and =/u = / /u) 
can be applied directly to show that 

Ui; V=U-V. 

In matrix algebra, the notion of partitioning a matrix into submatrices of 
contiguous rows and columns can be generalized to an arbitrary parti- 
tioning specified by a logical vector u. The following easily verifiable 
identities are typical of the useful relations which result: 


X.Y = (u/X) < (a//¥) + (u/X) % (u]/¥), 
uf(X > Y)= X ~% (u/Y), 
uj|(X 2 Y) = (uj/X) # Y. 


The first identity depends on the commutativity and associativity of the 
operator + and can clearly be generalized to other associative commu- 
tative operators, such as A, V, and :#. 

The generalized matrix product applies directly (as does the ordinary 
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matrix product X , Y)to vectors considered as row (that is, | x #) or as 
column matrices. Thus: 


z<— Xl yse a, = 6 (XO, y), 
z<y | X<>3,= Cafty Cy X)), 


z<y- ‘wz = O1/(y ©, *). 


The question of whether a vector enters a given operation as a row 
vector or as a column vector is normally settled by the requirement of 
conformability, and no special indication is required. Thus y enters as 
a column vector in the first of the preceding group of definitions and as 
a row vector in the Jast two. The question remains, however, in the case 
of the two vector operands, which may be considered with the pre-operand 
either as a row (as in the scalar product y | *) or as a column. The 
latter case produces a matrix Z and will be denoted by 


Z<y Og x, 


where Z,! = y, 9%; w(Z) = ry), and o(Z) = r(*).* For example, if 
each of the vectors indicated is of dimension three, then 


Mi Ya Y3 Yi Vis V1 
ELV= TN. Y. Ms YE = TI. J, Io fs 
VY Yo Ms Ys. Ya. Vs 
1 t 0 


a3)? a3)= ft 1 0 


1.12 TRANSPOSITIONS 


Since the generalized matrix product is defined on columns of the 
post-operand and rows of the pre-operand, convenient description of 
corresponding operations on the rows of the post-operand and columns 
of the pre-operand demands the ability to transpose a matrix B, that ts, to 
specify a matrix C such that C,/ = B,’. In ordinary matrix algebra this 
type of transposition suffices, but in more general work transpositions 


* Since cach “vector” y; OQ, x, is of dimension one, no scan operator ©, is required, 
and the symbol @ may be interpreted as a “null” scan, 
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about either diagonal and about the horizontal and the vertical are also 
useful. Each of these transpositions of a matrix B is denoted by a superior 
arrow whose inclination indicates the axis of the transposition. Thus: 


: : : ) 
C<-B Cy) = B; 
C<-B CPS BE pat 8, pnt) 
C—B Ci = BY jf = 1,2,...,(B) 
C<-B C; = badass 


a 


For a vector ¥, either x or * will denote reversal of the order of the com- 


» 
ponents. For ordinary matrix transposition (that is, B), the commonly 
used notation B will also be employed. 

Since transpositions can effect any one or more of three independent 
alternatives (i.e., interchange of row and column indices or reversal of 
order of row or of column indices), repeated transposition can produce 
eight distinct configurations. There are therefore seven distinct transforma- 
tions possible; all can be generated by any pair of transpositions having 
nonperpendicular axes.* 


1.13) SPECIAL LOGICAL MATRICES 


Certain of the special logical vectors introduced in Sec. 1.7 have usefu 
analogs in logical matrices. Dimensions will again be indicated in paren- 
theses (with the column dimension first) and may be elided whenever the 
dimension is determined by context. If not otherwise specified, a matrix Is 
assumed to be square. 

Cases of obvious interest are the fu// matrix E(m x n), defined by 
Em xX n) = 0, and the identity matrix Km x n), defined by 1° = (= /). 
More generally, superdiagona/l matrices "Kim x n) are defined such that 
‘Vitnm xn) = (j= i +k), for k =O. Clearly %=1. Moreover, for 
square matrices “fy ‘P= (OOD, 

Four triangular matrices will be defined, the geometrical symbols 
employed for each indicating the (right-angled isosceles) triangular area of 


* These transpositions generate the rotation group of the square [cf. Birkhoff and 
MacLane (1941) Chap. VI]. A pair of transpositions commute If and only if their axes 
are perpendicular. Hence the pair <- and t may be written unambiguously ast . 
Morcover, < =". The remaining two transformations can be denoted by,and&., 
with the convention that the operator nearest the operand (i.e., the horizontal) ts 
executed first. 


§1.14 Polynomials and positional number systems 27 


the m x n rectangular matrix which is occupied by ones. Thus 
- - i a 

C— (mk ny<> CC; 
’ oe . i . 
C<— "(mm X N)<> Ch yy; forfi=1,2,... mI 
andj=,2,...,4. 


! 
| 
| 
: water | SETS: min (a, 1)) 
C<-.(m xX nj<> Ct | 


C— (mm X ay<> CFT | 
The use of the matrices E and I will be illustrated briefly. The relation 
ui v= 2|)(u { v) can be extended to logical matrices as follows: 


U* V = (2E)|,)(U x V): 


the trace of a square numerical matrix X may be expressed as t = +/I/X, 
The triangular matrices are employed in the succeeding section. 


1.14 POLYNOMIALS AND POSITIONAL NUMBER 
SYSTEMS 


Any positional representation of a number nv in a base 6 number system 
can be considered as a numerical vector * whose base 6 value is the quantity 
n= ws, where the weighting rector w is defined by w= (A | 
br? bb! 1). More generally, * may represent a number in a 
mixed-radix system in which the successive radices (from high to low order) 
are the successive components of a radix vector y. 

The base y value of * is a scalar denoted by y_\_ * and defined as the 
scalar product y | * = wx, where w =i") 7 y is the weighting vector. 
For example, if y = (7, 24, 60, 60) is the radix vector for the common 
temporal system of units, and if* = (0, 2, 1, 18) represents elapsed time in 
days, hours, minutes, and seconds, then 


t= w tx = (86400, 3600, 60, 1) = (0, 2, 1, 18) = 7278 


is the elapsed time in seconds, and the weighting vector w is obtained as 
the product 


O11 1 7 x [(24, 60, 60) 86400 
_-._ fo ot tf. [24) — J x/60, 60) _ {| 3600 
10 0 01 60} | x 60) - 60 
00 0 0 60 x /e(0) 


If 6 is any integer, then the value of x in the fixed base 6 is denoted by 
(be) | x. For example, (2e) | «°(5) = 24. More generally, if y is any real 
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number, then (ye) | # is clearly a polynomial in y with coefficients ,, 
Xo,...,%,, that is, 


(ye) Le = ey"? Fo te ay + 8, 
Writing the definition of y | * in the form 


yess r ye 
exhibits the fact that the operation {| is of the double operator type. Its 
use in the generalized matrix product therefore requires no secondary scan 
operator. This will be indicated by a null placed over the symbol _. Thus 


ZeXPYoZi= Xi! ¥,. 


3 


For example, (ye) ? X represents a set of polynomials in y with coeffi- 
cients X,, X,,..., X,, and Yj represents a set of evaluations of the 
vector * in a set of bases Y!, Y2,..., Y" 


1.15 SET OPERATIONS 


In conventional treatments, such as Jacobson (1951) or Birkhoff and 
MacLane (1941), a set is defined as an unordered collection of distinct 
elements. A calculus of sets is then based on such elementary relations as 
set membership and on such elementary operations as set intersection and 
set union, none of which imply or depend on an ordering among members 
ofa set. In the present context it is more fruitful to develop a calculus of 
ordered sets. 

A vector whose components are all distinct has been called (Sec. 1.10) 
an ordered set and (since no other types are to be considered) will hereafter 
be called a set. In order to provide a closed system, all of the “‘set opera- 
tions” will, in fact, be defined on vectors. However, the operations will, in 
the special case of sets, be analogous to classical set operations. The 
following vectors, the first four of which are sets, will be used for tlustra- 
tion throughout. 


I 


n, o, n, s, e, t) 


( 

( 
s= ( 
d = (d, u, s, k) 
n=( 

( 


r,e.d, u, n.d, a, n, t) 


A variable = is a member of a vector ¥ if z = ¥, for some /. Membership 
is denoted by ze x. A vector ¥ includes a vector y (denoted by either 
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x > yor yC x) if each element y, is a member of ¥. If both x > y and 
x © y, then x and y are said to be similar. Similarity of « and y is denoted 
by *« =y. Forexample,t Co s,fortcaact, t=a,andt / 1. If 
xc yandx:7 y, then xis strictly included in y. Strict inclusion is denoted 
byxc y. 

The characteristic vector of * on y is a logical vector denoted by e,*, and 
defined as follows: 


u = €,*<->9(u) = r(y), and u, = (y, € 4). 


For example, e,’ = (0, 1, 1, 1, 0),e,° = (1, 1, 1), e,7 = (1, 0, 0, 0, 1), ey = 
(1, 0, 1,0), and e,,” = (1,0, 1, 0, 1, 1). 
The intersection of y with ¥ is denoted by y M x, and defined as follows: 


yor=e,*ly. 


For example, s Vd =(s,d), ds = (d,s), sOr=(a,t,e,d) and 
rms =(e,d,d,a,t) Clearly, « Ny =y Ox, although x A yis not, in 
general, equal to y M *, since the components may occur in a different 
order and may be repeated a differing number of times. The vector 
x © yis said to be ordered on x. Thus ais ordered ons. If xand y contain 
no common elements (that is, (¥ © y) = €(0)), they are said to be disjoint. 

The set difference of y and x is denoted by y A « and defined as follows: 


yArs €,"/Y. 


Hence y A x is obtained from y by suppressing those components which 
belong to x. For example, €,’ = (1, 0.0, 0, 1) and s Aft = (s, d). More- 
over, €,° = (0, 0,0) and tA s = €(0). 

The wnion of y and ¥ Is denoted by y Ux and defined as follows:* 
yUx=yQ(xAy). Forexample,s Ud = (s,a.t.e,d.u,k),d Us = 
(d,u,s,k,a,t,e) sUVUa@=s Ut=s, and nm Ut=(n,0,n,5,¢,t, a). 
In general x Uy =y Uw, and x =(x Oy) U(w Ay). If ¥ and y are 
disjomt, their union ts equivalent to their catenation, that ts, * Oy = €(0) 
implies thatx Uy =a yy. 

In the foregoing development, the concepts of inclusion and similarity 
are equivalent to the concepts of inclusion and equality in the conventional 
treatment of (unordered) sets. The remaining definitions of intersection, 
difference, and union differ from the usual formulation in that the result of 
any of these operations on a pair of ordered sets is again an ordered set. 
With respect to similarity, these operations satisfy the same identities as do 
the analogous conventional set operations on unordered sets with respect 
to equality. 

* The symbols U and (and the operations they denote) are commonly called cup 
and cup, respectively. 
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The forward selection o/b and the backward selection 7/b defined in 
Sec. 1.10 can both be used to reduce any vector b to a similar set, that is, 


(a/b)/b = (7/b)/b = b. 


Moreover, if f = (a/x)/x, g = (a/y)/y, and h = (o/z)/z, then x = y Nz 
implies that f = g Oh, and x = y Uz implies that f= g UA. 

The unit vector €/(7) will be recognized as a special case of the charac- 
teristic vector €,* in which * consists of the single component /, and 
y =u (n), where /r is the index origin in use. In fact, the notation e/, can be 
used to make explicit the index origin / assumed for e’. 

If z is any vector of dimension two such that 2, « * and 2, € y, then 2 is 
said to belong to the Cartesian product of * and y. Thus if * = (a, b, c) 
and y = (0, 1), the rows of the matrix 


a 


oct & 
-_- Oo - Oo = © 


are a complete list of the vectors z belonging to the product set of « and 
y. The matrix A will be called the Cartesian product of ¥ and y and will be 
denoted by ¥ ) y. 

The foregoing definition by example will be formalized in a more 
general way that admits the Cartesian product of several vectors (that is, 
uv (4+++()y) which need not be sets, and which specifies a unique 
ordering of the rows of the resulting matrix. Consider a family of vectors 
xox?) 2.2. x of dimensions dj, dy,...,d,. Then 


c ’ lid (k p “ 
A<~x BHO: -@K Sa ( Oe (Gb, Shy nis oes 


for all vectors k such that | < k, ~~ d;. Clearly »(A) = s, and (A) = 
x/d. As illustrated by Table 1.11, the rows of the Cartesian product A are 
not distinct if any one of the vectors %’ is not a set. 
If the vectors x’ are all of the same dimension, they may be considered 
as the columns of a matrix X, that is, X; = *’. The product #! 6 x? 
Ox = X, © X, +++ @ X, may then be denoted by ©/X, or 
alternatively by «//¥, where Y is the transpose of X. For example, if 


Re 
[eegincl 


then ~/X is the matrix of arguments of the truth table for three variables. 


X = (2) ° €(33) = | 
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a # 0 
a # 1 
a x QO 
a « | 
xl == (a, b, a) b # 0 
x? = (#, *) A={[b¥# 1 
x3 = (0, 1) b « O 
d = (3, 2,2) b « 1 
a # 0 
a # 1 
a_*« O 
ao* | 


Table 1.11 The Cartesian product A = x! x2. x8 


1.16 RANKING 


The rank or index of an element ce B is called the B index of ¢ and is 
defined as the smallest value of / such that c = b,. To establish a closed 
system, the B index of any element a ¢ 6 will be defined as the null character 
o, The b index of any element ¢ will be denoted by bic; if necessary, the 
index origin in use will be indicated by a subscript appended to the 
operator. Thus, if b = (a, p.e), Ba p = 1, and ba, p = 2. 

The b index of a vector c is defined as follows: 


k<—bic<>k,=bic, 


- 
The extension to matrices may be either row by row or (as indicated by a 
doubled operator symbol ¢) column by column, as follows: 
J<—BiC«<>J' = BC, 
K~— BuC<>K, = Bt C,. 
Use of the ranking operator in a matrix product requires no secondary 
scan and is therefore indicated by a superior null symbol. Moreover, 


since the result must be limited to a two-dimensional array (matrix), either 
the pre- or post-operand is required to be a vector. Hence 


J<« Bireo<>J' =B' re, 
K<b, C<>K, = bt C,. 


The first of these ranks the components of ¢ with respect to each of a set 
of vectors B', B*,..., B", whereas the second ranks cach of the vectors 
| OF ORE C,, with respect to the fixed vector b. 


32 The language §1.16 


The use of the ranking operation can be illustrated as follows. Consider 
the vector b = (a, b, c,d, e) and the set of all 3° three-letter sequences 
(vectors) formed from its components. If the set is ordered lexically, and 
if « is the ith member of the set (counting from zero), then 


j= (bye) |. (8 ty #). 
For example, if x = (c, a, b), then (6 t *) = (2, 0, 1), and 7 = 51. 


1.17 MAPPING AND PERMUTATIONS 


Reordering operations 


The selection operations employed thus far do not permit convenient 
reorderings of the components. This is provided by the mapping operation 
defined as follows:* 

C< AC; = a, . 


For example, if @ = (a, b,...,z) and k = (6, 5, 4), then c = (f, e, d). 
The foregoing definition is meaningful only if the components of k each 
lie in the range of the indices of @, and it will be extended by defining a, as 
the null element © if 7 does not belong to the index set of a. Formally, 
Qn. if m,«€ U(»(a)) 
C— 4,450, = 
° if m, ¢ UQ(a)). 
The ability to specify an arbitrary index origin for the vector a being 
mapped is provided by the following alternative notation for mapping: 


if m, € U((a)) 
c<m j, asc, = 


\9 if m, ¢ V(r(a)), 


an t 


where j-origin indexing is assumed for the vector a. For example, if a4 
is the alphabet and m = (5, 0, 0, 4, 27, o, 3), then c = mya = (fo, o, e, 
o,0, d), and (c¢ :/ c€)/c = (f,e, d). Moreover, m{,a@ = (d,9, 9, ¢, Z, 0, b). 
Elision of / is permitted. 

If ac b, and m= bi,a, then clearly m{,b =a. If a ¢ b, then 
m j, b contains (in addition to certain nulls) those components common to 
b and a, arranged in the order in which they occur in a. In other words, 


(m <4 c€)/(mJ,b) =a b, 


* For the purposes of describing algorithms, this notation is superior to the classical 
“disjoint cycles” notation for permutations [cf. Birkhoff and MacLane, (1941)] because 
(1) the direction of the transformation (from a4 to ¢) is unequivocally indicated, and (2) 
the notation directly indicates a straightforward and efficient method for actual execu- 
tion, namely, indirect addressing. 
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Consequently, if p, g,...,¢ are vectors, each contained in b, then each 
can be represented jointly by the vector b and a mapping vector. If, for 
example, b is a glossary and p, q, etc., are texts, the total storage required 
for b and the mapping vectors might be considerably less than for the 
entire set of texts. 

Mapping may be shown to be associative, that is, m' J, (m?[,@a) = 
(m' |, m?){,a. Mapping is not, in general, commutative. 

Mapping is extended to matrices as follows: 


A~<~M{|, B<> A’ = M'J, B’, 
C— M\j, Be> C, 7 Mj, B,. 


Row and column mappings are associative. A row mapping 1M and a 
column mapping ?M do not, in general, commute, but do if all rows of 
1M agree (that is, 'M =e ° p), and if all columns of ?M agree (that is, 
°"M =q° €). The generalized matrix product is defined for the cases 


m fA, and M{a. 

The alternative notation (that is, ¢ = a,,), which does not incorporate 
specification of the index origin, is particularly convenient for matrices and 
is extended as follows: 


A< B™ <> Ai = BM, 


A<- B,, <> A; = B,,,. 
Permutations 


A vector k of dimension v is called a j-origin permutation vector if 
k =v(n). A permutation vector used to map any set of the same dimen- 
sion produces a reordering of the set without either repetition or suppres- 
sion of elements, that is, k ;@ = a@ for any set a of dimension r(k). For 
example, if a = (f, 4, *, 6,z), and k = (4, 2,5, 1, 3), then kJ, a = (6, 4, 
z, f, *). 

If p is an /A-origin permutation vector and q is any /-origin permutation 
vector of the same dimension, then q |, p is an /-origin permutation vector. 

Since 

V(r(a)){,;a = a, 


the interval vector V(1) will also be called the ;-origin identity permutation 
vector. If p and q are two j-origin permutation vectors of the same 
dimension n and if q|,; p = v(m), then p |; q = U(n) also and p and q are 
said to be inverse permutations. If p is any /-origin permutation vector, 
then q = pu,U is inverse to p. 

The rotation operation & * * is a special case of permutation. 
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Function mapping 


A function f which defines for each element b,; of a set b a unique 
correspondent a, in a set a is called a mapping from b to a. If f(b,;) = a,, 
the element 6, is said to map into the element a,. If the elements /(b,) 
exhaust the set a, the function fis said to map 6 onto a. If b maps onto a 
and the elements /(6;) are all distinct, the mapping is said to be one-to-one 
or biunique. In this case, (a) = »(b), and there exists an inverse mapping 
from a to 6 with the same correspondences. 

A program for performing the mapping f from b to a must therefore 
determine for any given element b « b, the correspondent ae a, such that 
a = f(b). Because of the convenience of operating upon integers (e.g., 
upon register addresses or other numeric symbols) in the automatic 
execution of programs, the mapping is frequently performed in three 
successive phases, determining in turn the following quantities: 


1. the index i = beh, 
2. the index & such that a, = f(8,), 
3. the element a,. 


The three phases are shown in detail in Program 1.12a. The ranking is 
performed (steps 1-3) by scanning the set b in order and comparing each 
element with the argument 6. The second phase is a permutation of the 
integers 1, 2,..., (b), which may be described by a permutation vector 
j, such that j; = &. The selection of j, (step 4) then defines &, which, in 
turn, determines the selection of a, on step 5. 


Example 1.2. If 
b = (apple, booty, dust, eye, night), 


a = (Apfel, Auge, Beute, Nacht, Staub) 


are, respectively, a set of English words and a set of German correspondents (both 
in alphabetical order), and if the function required is the mapping of a given 
English word 4 into its German equivalent @ according to the dictionary corre- 
spondences: 

English: apple booty dust eye night 

German: Apfel Beute Staub Auge Nacht 
thenj = (1, 3,5, 2,4). If = “night,” then = 5,j; = 4,anda@ = a, = Nacht. 


If k is a permutation vector inverse to 7, then Program 1.126 describes a 
mapping inverse to that of Program 1.l2a. If j = (1, 3,5, 2,4), then 
k = (1, 4, 2, 5, 3). The inverse mapping can also be described in terms of 
j, as is done in Program 1.12c. The selection of the ith component of the 
permutation vector is then necessarily replaced by a scan of its components. 
Programs |.12d and 1.12e show alternative formulations of Program 1.12a. 


aA FF WwW N 


NY nA vA Fe WwW 


—; i<-0 1— > <0 
-——» jit i) i<«it+l 
op “f 
+ b: b; 3 a: a, 
kf; 4 kok, 
a «— a), [-— 5 b <5, —-> 
(a) b; <> 4G, (b) a; <> by 
— j<-0 
ies a: a; 
k <0 (d) 6, <> Gj, 
ae — 
<_— €e=> i 
b <- by, |———> ( 
aa, 
(c) @; <> Dy, 
(2) 6; <> aj, 
1 
a Set of correspondents in 


Programs (a, d, e) and set 
of arguments in Programs 
(b, c). 

b Set of arguments in Pro- 
grams (a, d, e) and set 
of correspondents in Pro- 
grams (b, c). 

j,k Mutually inverse 
permutation vectors. 


Legend 


Program 1.12 Mapping defined by a permutation vector j 
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Ordering vector 


If x is a numeric vector and & is a j-origin permutation vector such that 
the components of y = k J, x are in ascending order, then k is said to order 
x. The vector k can be determined by an ordering operation defined as 
follows: 

k<—6,/x 


implies that k is a j-origin permutation vector, and that if y = k J, *, then 
either ¥; << ¥,.1 Or ¥, = ¥;,, and k,; < k,.,. The resulting vector k& is 
unique and preserves the original relative order among equal components. 
For example, if « = (7, 3, 5, 3), then 0,/* = (2, 4, 3, 1). 

The ordering operation is extended to arbitrary vectors by treating all 
nonnumeric quantities as equal and as greater than any numeric quantity. 
For example, if @ = (7, °, 3, ,° 5, 3), then 0,/a = (3, 6, 5, 1, 2,4), and if 
b is any vector with no numerical components, then 6,/b = v'(+(b)). 

Ordering of a vector a with respect to a vector b is achieved by ordering 
the b-index of a. For example, if a = (e,a,s, t, 4, 7, t, h), and b is the 
alphabet, then m = b 4 a = (5, 1, 19, 20, °, °, 20, 8) and 0,/m = (2, 1, 8, 
3, 4, 7, 5, 6). 

The ordering operation is extended to matrices by the usual convention. 
If K = 0,//A, then each column of the matrix B = K fj, A is in ascending 
order. 


1.18 MAXIMIZATION 


In determining the maximum m over components of a numerical vector 
x, it is often necessary to determine the indices of the maximum components 
as well. The maximization operator is therefore defined so as to determine 
a logical vector v such that v/v = me. 

Maximization over the entire vector x is denoted by e[x, and is defined as 
follows: if v = e[x, then there exists a quantity m such that v/x = me and 
such that all components of »/x are strictly less than m7. The maximum is 
assumed by a single component of » if and only if +/v = |. The actual 
value of the maximum is given by the first (or any) component of v/s. 
Moreover, the j-origin indices of the maximum components are the 
components of the vector v/v’. 

More generally, the maximization operation v <— u[x will be defined so 
as to determine the maximum over the subvector u/x only, but to express 
the result » with respect to the entire vector x. More precisely, 


v<ulx<>v = u\(e[(u/x)). 


The operation may be visualized as follows—a horizontal plane punched 
at points corresponding to the zeros of uw is lowered over a plot of the 
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components of x, and the positions at which the plane first touches them 
are the positions of the unit components of v. For example, maximization 
over the negative components of x is denoted by 

v<(x < Ox 
and if * = (2, —3, 7, —5, 4, —3, 6), then (w < 0) = (0, 1, 0, 1, 0, 1, 0), 
v = (0, 1,0, 0,0, 1,0), v/v = (—3, —3), (v/x), = —3, and. v/t! = (2, 6). 
Minimization is defined analogously and is denoted by ul|¥. 

The extension of maximization and minimization to arbitrary vectors is 
the same as for the ordering operation, 1.¢., all nonnumeric quantities are 
treated as equal and as exceeding all numeric quantities. The extensions 
to matrices are denoted and defined as follows: 


Ve Ul[X<>V' = U'[X’, 
V<U|[[X=—V, = U,[X,, 
V<U[ xe Vi = Ux, 
Veul[X<V,= uf[X 
As in the case of the ordering operation, maximization in a vector @ with 


respect to order in a set b is achieved by maximizing over the b-index of a, 
Thus if 


j 


[freee 


a6kq435k82j92 
represents a hand of thirteen playing cards, and if 
c,d, h, s, 0, 6, 6, % 0,0, 0, a 6 
~ (; 3, 4, 5, 6, 7, 8, 9, 10, j, q, k, } 
1,0, 2, 1,2,3,2, 1,0, 2,0, 2, ‘ 
12,4, 11, 10, 2, 1, 3, 11, 6, 0, 9, 7, 0, 
(4, 13) ? (Buy H) = (25, 4, 37, 23, 28, 40, 29, 24, 6, 26, 9, 33, 13), 
and (e[((4, 13) ° (Bu H)))/H = (s, 3) 


is the highest ranking card in the hand. 


then B,,.H= ( 


1.19 INVERSE FUNCTIONS 


To every biunique* function f there corresponds an inverse function g 
such that e(f(x)) = x for each argument x in the domain of the function f- 
* If the function fis many-to-one, the specification of a unique inverse ¢ is achieved 


by restricting the range of g to some set of “principal” values, as is done, for example, 
for the inverse trigonometric functions. 
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It is common practice either to introduce a distinct symbolism for the 
inverse function, as for the inverse functions of logarithm (log, x) and 
exponentiation (b3), or to use a superscript —1, as in sin7!2 or f '(2). 

The first alternative doubles the number of distinct operator symbols 
required and obscures the relation between pairs of inverse functions; 
the second raises other difficulties. The solution adopted here is that of 
implicit specification; i.e., a statement is permitted to specify not only a 
variable but also any function of that variable. Functions may therefore 
appear on both sides of the specification arrow in a statement. For 
example, 

(2e) | x*<z 


specifies the variable x as the vector whose base two value is the number z. 
Certain ambiguities remain in the foregoing statement. First, the 
dimension of x is not specified. For example, if z = 12, » = (1, 1, 0, 0) is 
an admissible solution, but so are (0, 1, 1, 0,0) and (0, 0, 0, 1, 1, 0, 0). 
This could be clarified by compatibility with a specified dimension of e. 
Thus the statement 
(2e(5)) J wz 


specifies * unambiguously as (0, 1, 1, 0, 0). More generally, however, any 
previously specified auxiliary variables will be listed to the right of the 
main statement, with a semicolon serving as a separation symbol. The 
current example could therefore be written as 


v(x) — 5 


(2e) | «<-z; (x). 


The second ambiguity concerns the permissible range of the individual 
components of «. For example, the base two value of « = (5, 2) is also 
twelve. For certain functions it 1s therefore necessary to adopt some 
obvious conventions concerning the range of the result. The assumption 
implicit in the preceding paragraph is that each component of » is limited 
to the range of the residues modulo the corresponding radix. This con- 
vention will be adopted. Hence the pair of statements 


y< (7, 24, 60, 60) 
y i*x< 7278; y 
determines * unambiguously as the vector (0, 2, 1, 18). 


It is also convenient, though not essential, to use selection operations on 
the left of a statement. Thus the statement 


u/b<—a 
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is understood to respecify only the selected components of b and to leave 
all others unchanged. It is therefore equivalent to the statement 


b <— \ii/b, u, a’. 
Similarly, 
u/b <- ula 
is equivalent to 
b < /b, u, al. 


1.20 LEVELS OF STRUCTURE 


Vectors and matrices are arrays which exhibit one level and two levels of 
structure, respectively. Although in certain fields, such as tensor analysis, 
it is convenient to define more general arrays whose rank specifies the 
number of levels of structure (i.c., zero for a scalar, one for a vector of 
scalars, two for a vector of vectors (matrix), three for a vector of matrices, 
ctc.), the notation will here be limited to the two levels provided by the 
matrix.* The present section will, however, indicate methods for removing 
this limitation. 

The only essential particularization to two levels occurs in the provision 
of single and double symbols (e.g., ‘‘/”” and “//”, “|. and “|, ’) for row 
and column operations, respectively, and in the use of superscripts and 
subscripts for denoting rows and columns, respectively. In applications 
requiring multiple levels, the former can be generalized by adjoining to the 
single symbol an index which specifies the coordinate (e.g., “/,” and “/,”,, 
for row and for column compression, and, in general, “‘/,”.) The latter can 
be generalized by using a vector index subscript possessing one component 
index for cach coordinate. 

The generalized notation can be made compatible with the present 
notation for vectors and matrices by adopting the name tensor and a 
symbol class (such as capital italics) for the general array of arbitrary rank. 


1.21 SUBROUTINES 


Detail can be subordinated in a more general manner by the use of 
subroutines. The name of one program appearing as a single statement in 
a second program implies execution of the named program at that point; 
the named program js called a subroutine of the second program. If, for 
example, “Cos” is the name of a program which specifies z as the cosine of 


* Further levels can, of course, be handled by considering a family of matrices 1M, 
°2M,..., ™M, or families of families ;“M, 


’ 
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the angle between the vectors ¥ and y, then Program 1.13a uses the pro- 
gram “Cos” as a subroutine to determine r as the cosine of the angle 


between the vectors p and q. 


x<p Cos (Pp, q) 4 r= Cos (p, ) | 


(a) (6) (c) 


Program 1.13. Modes of subroutine reference 


It is sometimes convenient to include the names of the arguments or 
results or both in the name of the subroutine as dummy variables. Thus if 
“Cos («*, y)” is the name of a subroutine which determines z as the cosine 
of the angle between x and ¥y, then Program 1.135 uses Cos (4%, y) as a 
subroutine to determine r as the cosine of the angle between p and q. 
Similarly, the program ‘‘z <— Cos («, y)’”’ can be used as in Program 1.13c 
to produce the same result. 


1.22 FILES 


Many devices used for the storage of information impose certain restric- 
tions upon its insertion or withdrawal. The items recorded on a magnetic 
tape, for example, may be read from the tape much more quickly in the 
order in which they appear physically on the tape than in some other 
prescribed order. 

Certain storage devices are also self-indexing in the sense that the item 
selected in the next read from the device will be determined by the current 
state or position of the device. The next item read from a magnetic tape, 
for example, is determined by the position in which the tape was left by the 
last preceding read operation. 

To allow the convenient description of algorithms constrained by the 
characteristics of storage devices, the following notation will be adopted. 
A file is a representation of a vector x arranged as follows: 


Pi, *y, Po, %, a Mies Pree 619s Pare) cee ee Pup): 


The null elements denote the “unused” portion of the file not employed in 
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representing #. Each partition p, determines a position (position /) in the 
file. Ifa file P is in position /, then a forward read, denoted by 


x, p< WD, 


specifics x by the component #,, the auxiliary variable p by the succeeding 
partition p,,,, and stops the file in the position j + 1. 

The position of a file @ will be denoted by 7(). Thus the statement 
j <7) specifies j as the position of &, whereas m() <j positions the 
file to 7. In particular, 7(@) <1 denotes the rewinding of the file and 
m(@) <— v denotes winding, 1.e., positioning to the extreme end of the file. 
Any file for which the general positioning operation m(@)<—/ is to be 
avoided as impossible or inefficient is called a serial or serial-access file. 

Each terminal partition (that is, p; and p,;,)) assumes a single fixed 
value denoted by 2. Each nonterminal partition p; may assume one of 
several values denoted by A,,Aq,....A,q), the partitions with larger 
indices normally demarking larger subgroups of components within the 
file. Thus if » were the row list of a matrix, the last component might be 
followed by the partition A;, the last component of each of the preceding 
rows byA,, and the remaining components by A,. The auxiliary variable p 
specified by the partition symbol during the read of a file is normally used 
to control a subsequent branch. 

A file may be produced by a sequence of forward record statements: 


yD <—#,, p for ie UG(x)), 


where p is the partition symbol recorded after the component «,. As in 
reading, each forward record operation increments the position of the file 
by one. A file which is only recorded during a process is called an output 
file of the process; a file which ts only read is called an input file. 

Different files occurring in a process will be distinguished by righthand 
subscripts and superscripts, the latter being usually employed to denote 
major classes of files, such as input and output. 


Example 1.3, A set of a input files ®,4, /¢ u'(m), each terminated bya partition 
As, is to be copied to a single output file P," as follows. Successive items (com- 
ponents) are chosen in turn from files 1, ®,’,...,,,', Py', ®,1,..., always 
omitting from the sequence any exhausted file. A partition A, is to be recorded 
with the last item recorded on @,°, and all files are to be rewound. The process is 
described by Program 1.14. 


Program 1.14. Step 8 cycles & through the values | to #1, and step 9 allows the 
read on step 10 to occur only if u,, = 0. The logical vector t is of dimension mm 
and designates the set of exhausted files. Its Ath component is set to unity by 
step I1 when file & is exhausted, as indicated by the occurrence of the partition A. 
Each read is normally followed by step 13, which records on the output file the 
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1 — 
Input files for i € ¢(zm). 
2 Each has terminal 
3 partition Ay. 
‘ Output file. 
File ®,1 is exhausted if 
5 and only if wu; = 1, 
6 Item to be recorded. 
7 
L 
: gece) egend 
9 uz, 3 1 
10 b, p <- ®;" 
11 uy, < (p = Ag) 
12 use 
13 0? s= b, Ay 
14 0? <b, dy 


Program 1.14 Program for Example 1.3 


item read. However, when the last file becomes exhausted, step 14 is executed 
instead to record the last item, together with the final partition A). 

Steps 1-6 initialize the parameters u and & and rewind all files. After the last 
item is recorded by step 14, the file rewinds are repeated before the final termina- 
tion on step 7. 


It is sometimes convenient to suppress explicit reference to the partition 
symbol read from a file by using a statement of the form 


etl aie OPES 


where the indicated branches depend on the value of the partition p,,, 
which terminates the read. Thus the left or the right branch is taken 
according to whether p,,, =A, or Pj,, = Ay. Certain files (such as the 
IBM 7090 tape files) permit only such “immediate”’ branching and do not 
permit the partition symbol to be stored for use in later operations, as was 
done in Program 1.14. 

In recording, the lowest level partition A, may be elided. Thus statement 
13 of Program 1.14 may be written as 


O? <— b. 
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A file may be read or recorded backward as well as forward. A backward 
read is denoted by 


x, p< i, 


and if @ is initially in position / + [, then « = x, p = p,, and the final 
position becomes j/. Backward recording is defined analogously. The 
zero prescript may be omitted from the symbol , for both forward 
reading and recording. 

The conventions used for matrices can be applied in an obvious way to 
an array of files @,’. For example, the statement 


7((D') <—e€ 
denotes the rewinding of the row of files ®;', jeU\Q(@)); the statement 
m(@,) <€ 


denotes the rewinding of the column of files @,', ie U(u()); and the 
statement 
u/(b’ <— u/x, u/p 


denotes the recording of the vector component #, on file @,’ together with 
partition p, for all j such that uw, = 1. 

As for vectors and matrices, /-origin indexing may be used and will 
apply to the indexing of the file positions and the partition vector A as well 
as to the array indices. However, the prescripts (denoting direction of 
read and record) are independent of index origin. O-origin indexing is 
used in the following example. 


Example 1.4. Files ®,° and ?,° contain the vectors x and ¥, respectively, cach 
of dimension n, In the first phase, the components are to be merged in the order 
Xo, Vo. ty Vp.- ++ 5 %y—1s ¥y. p, and the first » components of the resulting vector 
are to be recorded on file o', and the last ” on file P,!. In other words, the 
vectors x! = a@”/z, and y! = w"/z are to be recorded on ®,! and ,}, respectively, 
where z = \x,u,y., and u = (0,1,0,1,...,0, 1). In the next phase, the roles 
of input and output files are reversed, and the same process is performed on xt 
and y!, that is, #2 = a"/(.x!, ua, y! ), and y? = w'"/( x1, ty!) are recorded on 
files ®)° and @,°, respectively. The process is to be continued through 1 phases. 


Program 1.15. The program for Example 1.4 begins with the rewind of the 
entire 2 x 2 array of files. To obviate further rewinding, the second (and cach 
subsequent even-numbered) execution is performed by reading and recording all 
files in the backward direction. Step 6 performs the essential read and record 
operation under control of the logical vector #, whose components tly, ty, tty 
determine, respectively, the subscript of the file to be read, the subscript of the 
file to be recorded, and the direction of read and record. The file superscripts 
(determining which classes serve as input and output in the current repetition) are 
also determined by w,, the input being uw, and the output @,. The loop 6-8 copies 
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0-origin indexing | 


1 n(®) — E(2 x 2) P File array of dimension 
2 x 2;original input 0°; 
2 u < e(3) original output ®!. 
3 m<«~m-—1 u Control vector. 
4 u<u wu, | Column index of input 
file. 
k nee, a, | Column index of output 
6 nie een file. 
7 Uy < ty tu, | Row index of current 
»! input file, and direction 
8 jt — A<~k—\1 of read and record. 
9 ut, n Number of items per 
file. 
10 Uy, Up : 
m Required number of 
merges. 
Legend 


Program 1.15 Program for Example 1.4 


n items, alternating the input files through the negation of uy on step 7. When 
the loop terminates, uy is negated to interchange the outputs, and the loop is 
repeated unless t4; = tt). Equality occurs and causes a branch to step 3 if and 
only if all 27 items of the current phase have already been copied. 

Step 3 decrements 7 and is followed by the negation of # on step 4. The com- 
ponent u, must, of course, be negated to reverse direction, but the need to negate 
uy and t, is not so evident. It arises because the copying order was prescribed for 
the forward direction, beginning always with the operation 


Dig. v 
oP — 9Po”. 


An equivalent backward copy must therefore begin with the operation 
ate ee 1," 

Not all computer files have the very general capabilities indicated by the 
present notation. Some files, for example, can be read and recorded in the 
forward direction only and, except for rewind, cannot be positioned 
directly. Positioning to an arbitrary position & must then be performed by 
a rewind and a succession of (A — |) subsequent reads. In some files, re- 
cording can be performed in the forward direction only, and the positions 
are defined only by the recorded data. Consequently, recording in posi- 
tion & makes unreliable the data in all subsequent positions, and recording 
must always proceed through all successive positions until terminated. 
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1.23 ORDERED TREES 


Directed graphs 


For many processes it is convenient to use a structured operand with the 
treelike structure suggested by Fig. 1.16. It is helpful to begin with a more 


21(mo) 
3 
~(m2) 


Figure 1.16 A general triply rooted tree with A(T) = 16, v(T) = (3, 3, 4, 3, 2). 
oT) = 5, u(T) = (3, 7, 8, 5, 3), and «(T) = 26 
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general structure (such as Fig. 1.17) in which a unidirectional association 
may be specified between any pair of its components. 

A directed graph comprises a vector n and an arbitrary set of unilateral 
associations specified between pairs of its components. The vector n is 
called a node vector and its components are also called nodes. The associa- 
tions are conveniently specified by a (logical) connection matrix U of 
dimension 1(n) x »(n) with the following convention: there is an associa- 
tion, called a branch, from node / to node / if and only if U,’ = I. 

A directed graph admits of a simple graphical interpretation, as 
illustrated by Fig. 1.17. The nodes might, for example, represent places, 
and the lines, connecting streets. A two-way street is then represented by 
a pair of oppositely directed lines, as shown between nodes 3 and 4. 

If k is any mapping vector such that 


UF =1 ~~ fori = 2,3,..., 2(k), 


then the vector p = k{n is called a path vector of the graph (n, U). The 
dimension of a path vector is also called its /ength, Nodes k, and k, are 
called the initial and final nodes, respectively; both are also called 
terminal nodes. If j is any infix of k, then g =j{n is also a path. It is 
called a subpath of p and is said to be contained in p. If v(q) < r(p), then 
q is a proper subpath of p. If k, = k, and p = kn is a path of a length 
exceeding one, p is called a circuit. For example, if k = (6, 1, 7, 7, 2, 6, 
1,5), then p = (tg, 1,, ;, M7, My, Ny, M,, M5) is a path vector of the graph 
of Fig. 1.17, which contains the proper subpaths (nz, mg, mg), (My, 1, 
11,, My, Mg, M,), and (t,,,), the last two of which are circuits. Node / 
is said to be reachable from node / if there exists a path from node / to 
node j. 


Ordered trees 


A graph (such as Fig. 1.16) which contains no circuits and which has 
at most one branch entering each node is called a tree, Since each node 
is entered by at most one branch, a path existing between any two nodes 
in a tree is unique, and the length of path is also unique. Moreover, 
if any two paths have the same final node, one is a subpath of the 
other. 

Since a tree contains no circuits, the length of path in a finite tree is 
bounded. There therefore exist maximal paths which are proper subpaths 
of no longer paths. The initial and final nodes of a maximal path are 
called a root and /eaf of the tree, respectively. A root is said to lie on the 
first level of the tree, and, in general, a node which lies at the end of a path 
of length / from a root, les in the /th /evel of the tree. 
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0000 1 0 1 
00000 1 0 
000 f 1 ft 0 
U=7 0 0 1 00 0 0 
00000 1 f 
1000000 
010000 ! 


Figure 1.17. A graphical representation of the directed graph (n, U). 


A tree which contains # roots is said to be w-fuply rooted. The sets of 
nodes reachable from each of the several roots are disjoint, for if any 
node is reachable by paths from each of two disjoint roots, one is a proper 
subpath of the other and is therefore not maximal. Similarly, any node 
of a tree defines a subtree of which it is the root, consisting of itself and 
all nodes reachable from it, with the same associations as the parent 
tree. 

If for each level j, a simple ordering is assigned to each of the disjoint 
sets of nodes reachable from each node of the preceding level, and if the 
roots are also simply ordered, the tree is said to be ordered. Attention will 
henceforth be restricted to ordered trees, which will be denoted by upper- 
case boldface roman characters. The height of a tree T is defined as the 
length of the longest path in T and is denoted by +(T). The number of 
nodes on level jis called the nioment of level j and is denoted by u(T). The 
vector p(T) is called the moment vector. The total number of nodes in T is 
called the moment of T and is denoted by “(T). Clearly, »(u(T)) = »(T), 
and +/u(T) = «(T) = »(n). The number of roots is equal to w,(T), and 
the number of leaves will be denoted by A(T). 

The number of branches leaving a node is called its branching ratio or 
degree, and the maximum degree occurring in a tree T is denoted by 0(T). 
The dispersion vector of a tree T is denoted by v(T) and is defined as 
follows: v,(T) = y,(T), and for j = 2,3,....2(T), v,(T) is equal to the 
maximum over the branching ratios of the nodes on level j — |. For the 
tree of Fig. 1.16, w(T) = (3, 3, 4, 3, 2). The number of roots possessed by 
a tree T (that ts, v,(T)) is called its dispersion. A tree possessing unity 
dispersion is called rooted or singular. 
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Each node n, of a graph (and hence of a tree) may be identified by its 
index 7. Since a tree admits of more convenient index vectors, the under- 
lying index 7 will henceforth be referred to as the graph index. 

In an ordered tree, any path of length k from a root can be uniquely 
specified by an index vector i of dimension k, where 4, specifies the partic- 
ular root, and the remaining components specify the (unique) path as 
follows: the path node on level / is the i,th element of the set of nodes on 
level 7 reachable from the path node on level j-!. The node at the end of 
the path can therefore be designated uniquely by the index vector ?. The 
degree of node i will be denoted by d(é, T). The index vectors are shown 
to the left of each node in Fig. 1.16. 

The path from a root whose terminal node is i will be denoted by T’. 
In Fig. 1.16, for example, T’ = (ng, ng, M3, Moy) if i = (2,2, 2,3). A 
vector @ is said to be an index of T if it is the index of some node in T. 

The subtree of T rooted in node 7 will be denoted by T;. Thus in Fig. 
(1, 3,3). A path in T, is denoted by (T;)/. For example, if G is an 
ascending gencalogical tree* with the sword and distaff sides denoted by 
the indices | and 2, respectively, then any individual « and the nearest 
(n — 1) paternal male ancestors are represented by the path vector (G,)&, 
where # is the index of x in G. 


Example 1.5. Determine the index é such that the path T? is equal to a given 
argument * and is the ‘“‘first’’ such path in T; that is, the function 


(a /¥(T)) i 
is a Minimum. 


Program 1.18. The index vector i specifies the path currently under test. Its 
last component is incremented repeatedly by step 7 until the loop 6-8 is terminated. 
If the path Ti agrees with the corresponding prefix of the argument x, termination 
occurs through the branch to step 9, which tests for completion before step 10 
augments é by a final zero component. Step 5 then respecifies das the degree of 
the penultimate node of the set of d paths next to be tested by the loop. Termina- 
tion by a branch from step 6 to step 2 occurs if all d possible paths are exhausted 
without finding agreement on step 8. In this event, retraction by one level occurs 
on step 2, and d is again respecified. If »(i) = 1, the paths to be searched com- 
prise the roots of the tree and d must therefore be specified as the number of 
roots. This is achieved by executing step 3 and skipping step 5. Retraction toa 
vector i of dimension zero occurs only if all roots have been exhausted, and final 
termination from step 4 indicates that the tree possesses no path equal to the 
argument %, 


* Although such a genealogical tree is not necessarily a tree in the mathematical 
sense, it will be assumed so for present purposes. 
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Program 1.18 Determination of é such that T? = x 


If d is a vector of dimension »(1t) such that d, is the degree of node n, of 
a tree T, then d is called the degree vector associated with n. In Fig. 1.16, 


for example, 
d = (3, 2,4, 0,0, 0, 2,..., 1,0, 0). 


Moreover, if is itself the alphabet (that is, m = (a, b,c,...,z)), then 
the vector n’ of Table 1.19@ is a permutation of m, and d’ is the associated 
degree vector. Table 1.19 shows another such pair, n” and d”. 

The degree vector provides certain useful information most directly. 
For example, sinee each leaf is of degree zero, A(T) = +/(d = 0). More- 
over, the number of roots is equal to the number of nodes less the total of 
the degrees, that is, w,(T) = 0d) — +/d, and the maximum degree 
occurring in T is given by 0(T) = ((e[d)/d),. Finally, the degree vector 
and the node vector together can, in certain permutations (those of Table 
1.19), provide a complete and compact description of the tree. 


Right and left list matrices 


If each one of the u(T) index vectors i of a tree T is listed together with 
its associated node (T"),;,, the list determines the tree completely. Since 
the index vectors are, in general, of different dimensions, it is convenient 
to append null components* to extend eaeh to the common maximum 
dimension #7). They may then be combined in an index matrix of 


* In the l-origin indexing system used here it would be possible to use the numeric 
zero to represent the null. In 0-origin indexing, however, zeros occur as components of 
index vectors and must be distinguishable from the nulls used. 
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1 n eo co co 1 3 
0 k o o o 2 1 
3 h o ce o 2 2 
0 j o o o 3 1 
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0 f o o I I t 
0 d o o | 1 2 
0 r o o 1 | 3 
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0 oO o c 2 2 1 
3 m o o 2 2 2 
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0 P eo | 3 1 1 
0 q >» | 3 1 2 
0 u « 2 2 2 1 
2 S o 2 2 2 2 
1 x o 2 2 2 3 
0 t 22 2 2 +1 
0 Ww 222 2 2 
0 y 22 2 3 =) 


Full right list matrix ]T 
()) 


Table 1.19 Full list matrices of the tree of Fig. 1.16 


dimension “(T) x »(T), which, together with the associated node vector, 
completely describes the tree, If, for example, the node vector n is the 
alphabet, the tree of Fig. 1.16 is described by the node vector n’ and index 
matrix I’ of Table 1.19a or, alternatively, by 2” and I” of Table 1.194. 
Because of the utility of the degree vector, it will be annexed to the array 
of node vector and index matrix, as shown in Table 1.19a@, to form a full 
list matrix of the tree. The degree vector and node vector together will be 
called a list matrix. As remarked, the list matrix can, in certain permuta- 
tions, alone describe the tree. 
Formally, the full list matrix M of a tree T is defined as follows: «°?/M 
is an index matrix of the tree, M, is the associated degree vector, and M, 
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is the associated node vector. Thus for each k e U(u(T)), M," = 64, T), 
and M,* = (T'),.;, where i is the nonnull portion of @/M", that is, i = 
((a?/M*) =4 ce)/a?/M"). The corresponding list matrix is «?/M. 

Since a full list matrix provides a complete description of a tree regard- 
less of the order in which the nodes occur in the list, any column permu- 
tation MP (that is, any reordering among the rows) is also a full list matrix. 
Two particular arrangements of the full list matrix are of prime interest 
because each possesses the following properties: (1) the nodes are grouped 
in useful ways, and (2) the list matrix (i.e., the degree vector and node 
vector) alone describes the tree without reference to the associated index 
matrix. They are called the full /eff list matrix and full right list matrix and 
are denoted by [T and JT, respectively. Table 1.19 shows the full left and 
full right lists of the tree of Fig. 1.16. 

The left list index matrix I is left justified,* that is, the null clements are 
appended at the right of each index. The rows I’ are arranged in increasing 
order on their values as decimal (or rather (6(T) + 1)-ary) fractions with 
the radix point at the left and the nulls replaced by zeros. More precisely, 
the rows are arranged in increasing order on the function G(a)e) | 
(a « I’), where a = (9, 1, 2,..., O(T)).t 

The right list matrix is right justified and is ordered on the same function, 
namely (v(a)e) (4 4) V’). The rows are therefore ordered on their values 
as integers, i.c., with the decimal point at the right. From the example of 
Table 1.194 it is clear that the right list groups the nodes by levels, Le., 
level 7 is represented by the infix (| «")//QT), where k = w(T), and 
i= +/a’/u(T). In Table 1.196, for example, w(T) = (3. 7, 8.5, 3), and 
if7 = 3, then kA = 8, / = 10, and level / is represented by rows/ + 1 = I] 
to?+k=18. The right list is therefore useful in executing processes 
(such as the pth degree selection sort) which require a scan of successive 
levels of the tree. 

The left list groups the nodes by subtrees, i.c.. any node 7 is followed 
immediately by the remaining nodes of its subtree T;. Formally, if J = 
e/[T, and if i = I" 4 °e)/I", then the tree T; is represented by the infix 
(A — 1) La)//[T. In Fig. 1.19a, for example, if & = 16, then i = 
(2, 2, 2), “(T;) = 7, and T; is represented by rows 16 to 22 of [T. The left 
list is therefore useful in processes (such as the construction of a Huffman 
code and the evaluation of a compound statement) which require a 
treatment of successive subtrees. 

The row index of a node in a right (left) list matrix is a graph index of 
the node and will be called the right (left) list index. 

* The term /ef? list and the notation [T are both intended to suggest left justification. 


+ These statements hold only for J-origin indexing. In O-origin indexing, @ = 
(-,0,1,...,0(T) —1). 
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Well formation 


A two-column matrix which forms the right list of some tree is said to 
be a well formed right list. Since the ordering of the nodes in a right list of 
a given tree is unique, the right list of a given tree is unique. Conversely, 
any well formed right list specifies a unique tree according to the algorithm 
of Program 1.20. 

Identical remarks apply to the left list, except that Program 1.20 is 
replaced by Program 1.21. Moreover, the necessary and sufficient 
conditions for the well formation of a left list are identical with those fora 
right list and are derived by virtually identical arguments. The case will 
be stated for the right list only. 

If R is a well formed right list representing a tree T, then the dispersion 
(i.e., the number of roots) v,(T) = »(R,) — (+/R,) must be strictly 
positive. Moreover, if S = @//R is any suffix of R, then S is a right list of 
the tree obtained by deleting from T the first / nodes of the original list. 
For, such level-by-level deletion always leaves a legitimate tree with the 
degrees of the remaining nodes unchanged. Consequently, the number of 
roots determined by every suffix of R, must also be strictly positive. In 
other words, all components of the suffix dispersion vector s defined by 


s, = ae YR,) —(+/a YR), fe UO(R)) 


must be strictly positive. The condition is also sufficient. 

Sufficiency is easily established by induction on the column dimension 
of R. The condition is clearly sufficient for v(R,) = 1. Assume it sufficient 
for dimension #(R,) — |. If s, the suffix dispersion vector of R, is strictly 
positive, then @!/s, the suffix dispersion vector of @!//R, is also positive, 
and by hypothesis @'//R represents a tree G possessing s, roots. Moreover, 


0<s,=s, + — R,') 


implies that s, = R,', and the number of roots possessed by G therefore 
fulfills the number of branches required by the added node R,'. A 
legitimate tree corresponding to R can therefore be formed by joining the 
last R,' roots of G to the node R,!. 

Tests for well formation can therefore be incorporated in any algo- 
rithm defined on a right or left list matrix M by computing the components 
of the suffix dispersion vector s. The recursion s,, = s; + | — Mj~‘is 
convenient in a backward scan of M, and the equivalent recursion 
S$; = 8,4 — 1+ M'~' serves for a forward scan. The starting condition 
for a forward scan is s; = »(M,) — (+/M,), and for a backward scan is 
s, = | — M,". Sinee the criteria of well formation are identical for right 
and left lists, a matrix may be characterized simply as well or ill formed. 
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The purpose served by the degree vector d in the description of a tree is 
sometimes served instead [e.g., Burks et al. (1954)] by the vector g = 
e — d. It is somewhat more convenient in the analysis of weil formation, 
since the expression for the suffix dispersion vector then simplifies to 


s..=(+/e/g, or s=(h+™)j/g. 


The index matrix as a function of the degree vector 


The complete determination of the tree corresponding to a given list 
matrix M is best described as the determination of the associated index 
matrix I. For both left and right lists this can be achieved by a single 
forward scan of the rows of M and of I. 

For a right list R it is first necessary to determine r, the number of roots. 
The first r components of R are then the roots of the tree in order, the 
next R,’ components of R are the second-level nodes reachable from the 
first root, and so forth. Programs 1.20 and 1.21 describe the processes for a 
right list and a left list, respectively. 


Program 1.20. In each execution of the main loop 13-16, the ith row of the 
right list R is examined to determine the index vector of each node on the succeed- 
ing level which is directly reachable from it. The number of such nodes is con- 
trolled by the parameter ¢, initialized to the degree of the ith node by step 12. 
The (right list) index of the nodes reachable from node / is determined by j, which 
is incremented on step 14 as the index vector of each node is determined. The 
index vectors of the successive nodes reachable from node i have the final com- 
ponents I, 2, 3,..., and cach must be prefixed by the index vector of node /. 
This assignment is effected by the vector v, which is initialized by the index vector 
of node / rotated left by one (step 11), and which Is incremented by step {5 before 
each assignment occurring on step 16. At the outset, v is set to zero and dis set 
to the number of roots as determined by step 4. 

Since jis, at step 10, equal to the current number of roots r augmented by the 
cumulative degrees of the first / — 1 nodes, then r = j — i + 1 and the exit on 
step 10 therefore occurs always and only in the event of ill formation. Alterna- 
tively, the test can be viewed as an assurance that cach row of the matrix I is 
specified before it is itself used in specification. 

When step 5 is first reached, the index matrix I is complete but is expressed in 
1-origin indexing with zeros representing the null elements. Steps S—7 translate 
the matrix to the origin ¢ and mask in the necessary null elements. 


Program 1.21. The index vectors I’ are determined in order under contro] of 
the parameter j. The loop 5. 18 traces a continuous path through the tree, deter- 
mining the index of each successive node of the path by rotating the index of the 
preceding node (step 17) and adding one to the last component (step 13), and 
maintaining in the connection vector ¢ a record ¢;,, of the index / of the successor 
of node / in the path traced. The path is interrupted by the occurrence of a leaf 
(that is, L,’ = O on step 18), and the degree vector L, is then scanned by the loop 
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Program 1.20 Determination of the index matrix I associated 
with a right list matrix R 


(19-20) to determine the index / of the last preceding node whose branches remain 
incompleted, Steps 22.-23 then respecify v as the index vector of the node follow- 
ing node / in the path last traced, and step 21 decrements the component L,‘ of 
the degree vector. The branch from step 19 to step 22 occurs at the completion 
of cach rooted subtree. The test for well formation (step 12) is the same as that 
applied to the right list in Program 1.20, except that the notation for the relevant 
parameters differs. The concluding operations (6-9) include left justification on 
step 7. 


Tree, path, and level compression 


The tree compression 
P<—U/T 


specifics a tree P obtained from T by suppressing those nodes corre- 
sponding to zeros of the logical tree U, and reconnecting so that for every 
pair of nodes x, y of P, x belongs to the subtree of P rooted in ¥ if and only 
if x belongs to the subtree of T rooted in y. If, for example, T is the tree of 
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Program 1.21 Determination of the index matrix I associated 


with a left list matrix L 


Fig. 1.16 with n as the alphabet, and U is the tree of Fig. 1.22a, then P 
is the tree of Fig. 1.226. The new indices are shown to the left of each node 


of P. The set of nodes 221, 222,..., 226, are all on the same level of P 


although they have been shown with branches of different lengths to 
permit easy identification with the nodes of the original tree T. 
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U U/T 
(a) (b) 
Figure 1.22 Compression of tree T of Fig. 1.16 (with m = alphabet) 


The compress operation is best executed on the left list because of the 
grouping by subtrees. Program 1.23 gives a suitable algorithm which also 
serves as a formal definition of the compress operation. 


Program 1.23. The vector u is specified as the node vector of the left list of the 
controlling logical tree U and controls the subsequent process. Step 4 determincs 
jas the index of the first zero component of ut, Steps 6 and 7then delete the corre- 
sponding nodes of u and of the left list of T, but only after step 5 has determined 
as the change in degree which this deletion will occasion to the root of the smallest 
subtree containing the deleted node. Steps 9~11 perform a backward scan of the 
degree vector to determine / as the index of the root of the subtree, and step 12 
effects the requisite change in its degree. The exit on step 9 occurs only if the node 
deleted is a root of the original tree, in which event no change is produced in the 
degree of any other node. 


Two further compress operations controlled by logical vectors are 
defined as follows. Path compression is denoted by 


P< u/T. 
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Program 1.23 Determination of the left list L = a2/[((U/T) 


P is obtained from T by suppressing every node on level jif uw; = 0, and 
reconnecting as in tree compression. Leve/ compression is denoted by 


P< 4u//T, 


and P is obtained from T by deleting each rooted subtree T, for which 
u, = 0. 

Path compression by a unit vector €’ produces a tree of height one. 
Such a tree is, in effect, a vector and will be treated as one. 

Two related special logical trees are defined: the path tree “E such that 
u/“E = 0 and u/“E is the full tree E whose nodes are all unity, and the 
level tree ,,E such that u//,E = 0, and u//,E = E. 


Extension of other operations to trees 


Two trees are compatible if they have the same structure. Elementary 
binary operations are extended node by node to compatible trees. For 
example, 

Z<-Xx Y 


implies that node i of Z is the product of nodei of X and node i of Y for all 
i, Similarly, 
M<6.,,T 
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specifics M as a tree (of the same structure as T) such that node 7 of M is 
the j-origin b-index of node 7 of T. 
The mapping operation is extended to trees so as to permute the rooted 
subtrees of a tree. Formally 
P<m{f,T 


implics that u,(P) = (7m), that P; is a single null character if m, ¢U(u,(T)), 
and otherwise P, = T,,,, where j-origin indexing is used for T. 
Permutation of the subtrees rooted in node i of T can be effected as 
follows: 
at/T, <- m | (a'/T)). 


The notation ¢//T will denote the application of the binary operator or 
relation © to the nodes of T in right list order (1.c., down successive levels) 
and ©/T will denote the same application in left list order (i.c., across 
paths). If the operator is symmetric (Le., its operands commute), then 
o//T = c/T. 

Maximization (U[T) and minimization (U|T) are extended to trees in 
the obvious way. 

The operations #/u, m/u, o/a, and 7/a are cach extended in two ways: 
across paths and down levels. Examples of each appear in Fig. 1.24. 
Operations extending down levels are denoted by double virgules and 
represent an application of the corresponding vector operation to each 
level of the tree considered as a vector. For example, the statement 


V~aoal//A 


implies that each level of V is the forward set selection of the corresponding 

level of A, that is. €//V = o/e’/A. Operations extending across paths are 

denoted by single virgules and are defined in terms of subtrees. Thus 
V<z/U 


implics that V is obtained from the logical tree U by setting to zero all 
nodes of any subtree rooted in a zero node, and 


Va of{U 


implies that V is obtained from U by setting to zero every node whose 
subtree contains a zero node, The definitions of ¢/U and 7/U are analo- 
gous. 


Homogeneous trees 


If, for all 7, every node on level / of a tree T is either of degree zero or of 
degree v,_,(1), then the tree T is said to be uniform. If all leaves of a 
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uniform tree T lie in the same level (necessarily the top), then the tree is 
said to be homogeneous. The structure of a homogeneous tree is completely 
characterized by its dispersion vector v(T). All maximal paths in a homo- 
geneous tree are clearly of the same length, namely o(T) = »(v(T)). Figure 
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1.25 shows a homogeneous tree and its associated dispersion vector. 
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Figure 1.24 Set selection and maximum prefix and suffix operations 
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A tree T for which v(T) = me is called an mi-way tree, and a tree for 
which v,(T) = 1 and a@!/v(T) = me is called a singular m-way tree. 


v(H) = (2,3, 2) 
u(H) = (2,6, 12) 


Figure 1.25 Homogen- 
eous tree H and disper- 
sion and moment vectors 


The jth component of the moment vector of 
a homogeneous tree is clearly equal to the prod- 
uct of the first 7 components of the dispersion 
vector, that is, w(T) = (C1 + N, v(T). The dis- 
persion vector is, in turn, uniquely determined 
by the moment vector. The total number of 
nodes is given by u(T) = +/u(T), and it can also 
be shown that “(T) = y | y, where y is the 
dispersion vector in reverse order. 

Tree compression of a homogeneous tree H 
{that is, U/H) does not generally produce a 
homogeneous tree, and, in fact, any tree P of 
arbitrary structure can be represented by a pair 
of homogeneous trees U and H such that P = 
U/H. On the other hand, both path and level 
compression of homogeneous trees produce 
homogeneous trees. Moreover, if P = u/H, then 
v(P) = u/v(H), and if P = u//H, then v(P) = 
v(H) — (+/i)a!. 

Since the structure of a homogeneous tree is 
completely specified by its dispersion vector k, 
the structure of the special logical trees can be 
specified in the forms E(k), “E(k), and ,,E(%). 

In a homogeneous tree, the right list or left 
list index of a node can be determined as an 
explicit function of its index vector. Conversely, 
the index vector 7 can be determined directly 
from the corresponding left list index, to be 
denoted by /(é), or from the right list index r(é). 
In developing the relations between indices it 
will be convenient to use Q-origin indexing 
throughout. 

The right list index is given by 


ri) = f(@) + g@,. 
where f@ = +/e"?~'/e(T) 


is the number of nodes in the first #(#) — | levels, and 


ei) = (a /o(T)) | i 


is the rank of node i in the »(i)th level. For example, if i = (1, 0, 1) in the 
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tree of Fig. 1.25, then w(H) = (2.6, 12), fli) =*/(2,6) = 8, and 
gf) = (2,3, 2) | (1,0, 1) = 7. 

Since f(i) depends only on (4), the index i may be determined from r by 
first determining +(i) as the largest value for which f(é) <r, and then 
determining # such that 

(a W(T)) Lise — fl. 

In tracing a path through a tree, the kth node of the set reachable from 
node i is the node jf = 7 (k). It is therefore useful to express r(j) asa 
function of r(i). Clearly 

IG) = FG) + (RTD) nw 1 
8) = gi) x (WT) ay FF. 
In the special case of a singular homogeneous m-way tree, 
fi) =Lt mt meee tm = (me) | eli) — 1) 

v(i)—1 

m — | 


m— 1 


Hence f(j) = 1 + m x f(i), and g(j) = m x gli) +j,_,. Recursion can 
therefore be performed simply upon the single function r(i) as follows: 


rj) =m x ri) 1 + fee1. 

The left list index i) is most conveniently expressed as a function of 
(i) and of the vector 2(i) (zero extension of i), where z = a ’(O(T))\i. 
Clearly »(z) = »(T) and 2 is the index of the “‘earliest” leaf reachable from 
node i. In Fig. 1.25, for example, 2((1, 2)) = (1, 2, 0). 

The zero extension has the obvious property that every node above the 
path T*? precedes node i in the left list, and every node below the path 
follows it. The number of nodes in the path which precede node # is 
vi) — 1, 

The number of leaves above the path T*? is v(T) . 2(i), and more 
generally, the number of (j— 1)th level nodes above it is given by 
(a?/v(T)) | (a’/z{i)). Consequently, 

+(T) 
Ki) = vi) — 1 + & (a’/w(T)) _ (a’/2(7)). 
j= 
For example, if # = (1, 0) in Fig. 1.25, then 2(7) = (1, 0, 0) and 
The foregoing result may be written alternatively as 
Ii) = oi) — 1 + w % 2(i), 


where w, = 1, and w,_; = 1 + (w, x v,(T)). In the foregoing example, 
w= (10,3, 1), and w % 2) = 10. This form is most convenient for 
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determining i as a function of /, for since wy z=/+ 1 — 2%), then 
zi) = | + |, 2,4) = (wo |) — 1) + I, ete. for all positive values 
of the quotient, and all components thereafter are zero. The dimension 
v(i) is then determined from the relation xi) = /+ 1 — w ¥ x(i). 
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EXERCISES 


Organize cach of the programs according to the method of leading decisions. 
Except where otherwise indicated, use ]-origin indexing. The conventions of 
Sec. $.1 of the Summary of Notation will be used in the statement of each 
of the exercises. 
11° Letd = (a, 2, 3,4, 5, 6,7, 8,9, 10,j,q,k),s =(c,d,h,s),u = CJ, 0, 1, 0, 1), 
v =(0,1, 1, 1,0), * = (16, 8,4, 2, 1), andy = (2, 3, 4, 5, 6). Determine 

(a) the dimensions »(d@), »(s), and v(x). 

(b) the vectorsx +¥,*% —y,*% Xx ¥,*% +y,andu 4 v, 

(c) the logical vectors u A v, u V v, (te # v), and (u = v), 

(d) the reductions +/x, x/y, A/u, and y /v. 

(e) the base two values of t and of v, that is, +/(~ x uw), and +/(* x v). 

(f) the rotated vectors 2 | d,4 Ts, and Ty. 

(g) the unit vector e'(5) in a 1-origin system, and e*(5) in a 0-origin system. 

(h) the infixes (@°(7) A w(7)) and 2 | a3(7). 
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1.2. Show that 

(a) x/u(n) =n! (Include the case n = 0.) 

(b) +/0@1) = nla + 27 — 1) + 2. 

(c) x/(k Tx) = x/x. 

(d) (kts) +(kTy) = kt (e+). 
1.3. Write detailed (i.c., component-by-component) programs for the following 
operations. Include tests for compatibility of the operands. 


(a) w<-u Art. (g) wu <al(k). 
(b)W«.Uy V. (h) w< f] ak), 
(c) B<- Ufa, (i) ¢ <-ja,u,b., 
(d) B<«-u/A. (Gj) ¢ <- fa, u, bf. 
(ce) B«-ulv//A, (k) ¢ «<- wa. 


(f) * «(x > 0)/x. 
1.4 Establish the identities 
(a) ja, u,b) = ‘ula, u,u/b.. 
(b) \a, u,b. = /ua,u, uid]. 
1.5 The classic “‘rings-e-seven” puzzle can be posed as follows: an ordered 
collection of # rings is to be placed on (removed from) a bar under the following 
constraints: 
(i) ring » may be placed on or removed at will. 
(ii) ring & may be placed on or removed only if ring (A + 1) is on and all 
succeeding rings are off. 
The state of the rings can be described by a logical vector u, with tw; = 1 if ring & 
is on, Write programs on u which describe the removal of the rings beginning 
with 
(a) uw = € [The successive values of u represent a reflected Gray code; sce 
Phister (1958).] 
(b) tw arbitrary. 
1.6 The ordered array of variables used to represent a variable 2 in some coding 
system may be considered as a vector representation of «, denoted by e(r). In the 
8421 code for decimal digits, for example, e(0) = (0, 0, 0, 0), e(1) = (0, 0, 0, 1), 
and, in general, e(~) is defined by the relation +/[w x e(#)] = x, where w = (8,4, 
2,1). For each of the following coding systems, (see Richards, pp. 183-184 for 
definitions), write a concise expression for e(): 
(a) the excess-three code for decimal digits. 
(b) any chosen two-out-of-five code. 
(c) any chosen biquinary code. 
(d) the semaphore code for the alphabet (see any good dictionary). Denote 
each code by a two-component vector e(z) © 08), Use ae, where 
a =(a,b,c,...,Z). 
1.7 Let X be a square sparse matrix represented by the logical matrix U = 
(X # 0) and either or both of the vectors r = U/X, and c = U//X. Write pro- 
grams to determine the product Y = X 3} X, using the arguments 
(a) r,c, and U, 
(b) rand U. 
(c) cand U. 
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1.8 Prove that 

(a) fr] = —| —r]J. 

(b) lla + b] +c] = la + be] for all positive integers a, 6, and c. 
1.9 Letr = E/A, and ¢ = E//A be the row list and column list, respectively, of 
the matrix A, and let r,, A,’, and c, be corresponding elements of the three 
representations of A, Determine: 

(a) fas a function of k, »(A), and ;(A), 

(b) & as a function of #, »(A), and (A). 

(c) the permutation vector A such that c = A { Yr, 
1.10 Show that 


(a) //u =‘) /t (Use De Morgan’s law for two variables and induction.) 

(b) //tu = 2\, +/t (Use induction.) 

(c) =/u = 2) +/u. 

(d) ¥/u = =/u. 

(ec) Uv = (2e) [g(U & v). 

(f) Us V=UyY. 

(g) (fu) \ (vow) = (Fe Ww) A (ve Ud). 
1.41 (a) Show that +/* = +/(u/x) + +/(u/x). (Include the case u = 0.) 
(b) What properties are required of an operator © that it satisfy the relation 
established for + in part (a)? 
1.12 Show that 

(a) XY = (u/X) . (a//¥) + (u/X) ¢ (u//Y). 

(b) uf(X 7 Y) =X 5 (u/Y). 

(c) uff(X , Y) = (u//X), Y. 

(d) (wu "v/a = (u/v)/(tt/a), 
1.13. Use the result of Exercise |.11(b) to extend the results of Exercise 1.12(a-c) 
to logical operators. 
1.14 Write programs to determine: 

(a) the value of the polynomial x at the point a, that is, to evaluate (ye) | w 

for y = a. Use no more than »(x) multiplications. 
(b) the derivative of the polynomial x, that is, the vector % such that 


fl 
(ye) | 2 = 5 (ue). x), and o(z) = (x). 
ad 


(c) the integral z of the polynomial x satisfying the boundary condition 
(ae) z=. 
the quotient q and remainder r obtained in dividing the polynomial n by 
the polynomial d, for »(d) <= »(1)}. 
(e) the value of the polynomial m at the point @ by using part (d) with d = 
(1, —a). 


dl 
(f) the value f= ((ye) 
dy 


(d 


— 


|. 1) at the point a by two applications of part (¢). 
(g) an approximate real root of the equation (ye) | x = 0, using parts (e) and 
(f) and the Newton-Raphson formula [Kunz (1957)]. 
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1.15 Let the components of the vector r be the real roots of a polynomial x. 
Write a program to 

(a) determine the symmetric functions of r. [Dickson (1939), Ch. X.] 

(b) determine x as a function of r. 

1.16 Write a program to determine the polynomial x consisting of the first 7 
terms of the exponential series | + y + yf) ones 

1.17 Write a program to determine the moduli of all roots of the polynomial 4, 
using the Gracffe method [Kunz (1957)]. Assume that operations for the logar- 
ithm and exponential functions are available as subroutines. 

1.18 List all the l-origin permutation vectors of dimension four which are self- 
inverse. 

1.19 Using 1-origin indexing, write programs to derive 

(a) the permutation R which is inverse to the permutation /. 

(b) a permutation # which transforms a given logical vector t to a prefix vector. 
1.20 A square logical matrix U such that +/U = +//U = € is sometimes called 
a permutation matrix, since premultiplication of a numerical vector « determines 
a permutation of x, Write programs to determine 

(a) the permutation matrix U corresponding to the I-origin permutation vector 
k, that is, determine U such that Uy x =k J, ¥. 

(b) the permutation & corresponding to a given permutation matrix U. 

(c) the permutation V which is inverse to the permutation U. 

1.21 Let p be the vector representation of a permutation and let ¢ be the 
standard representation in terms of disjoint cycles, including all cycles of one 
[Jacobson (1951), p. 34.) Each cycle of ¢ is enclosed in square brackets, each 
half-bracket being considered as a component of c. For example, if ¢ = ({, 1, 3, 
5,1, 2,4, ], [, 6, )), then p = (3, 4, 5, 2, 1, 6), #(e) = 12, and »(p) = 6, and, in 
general, »(¢) = »(p) + 2k where & is the number of disjoint cycles in p. The 
usual elision of cycles of one would give ¢ = ({, 1, 3, 5, J, [, 2, 4, ]), but this deter- 
mines a untque correspondent p only if the dimension of p Is otherwise specified, 
and inclusion of all cycles of one will therefore be assumed. If cach infix of 
numerical components in ¢ is preceded by a left bracket and followed by a right 
bracket, and if ¢ determines a legitimate permutation vector p, then € is said to be 
well formed, 

(a) Write a program to determine p as a function of a well formed permutation 
c. Include determination of the dimension of p. 

(b) Modify the program of part (a) to incorporate checks on the well formation 
of c. If ¢ is ill formed, the vector p is to be defined as the literal “ill 
formed.” 

(c) Modify part (b) to process a sequence of vectors c!, c®,... , cach pair being 
separated by a single null element, and the end of the sequence being 
indicated by a pair of successive null elements, i.e., to process % = 
cle (ec) e% 6 +++ 5 EF ~ (0,2), Include checks on the well formation 
of cach permutation. 

(d) Write a program to determine the parity [Jacobson (1951), p. 36] of a 
permutation vector p. 
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1.22 Write detailed programs for the following processes: 


(a) Rk «- 0,/* (i) m<-baya 
(b)y< mf,x  (j) M<«-B iy 
(c) v<« ufx (k) u< €," 


(d)V. ufX  () c+-b aa 


(e) v«-a/tt (m)ce<«-b fa 
(f) Vi«sw//U (n) ¢<-bU 4 
(g) v< a/b (0) C«-a%b 
(h) V< 7//B 


1.23 (a) Copy onto file p,? successive groups of items from the row of files @? 
in cyclic order, omitting any exhausted files, The end of each group is 
demarked by a partition A,, and the end of each file by a partition Ax. 

(b) A file which is always recorded in the forward direction and read tn the 
backward direction functions as a stack. Using file @,? as a stack, 
modify the program of part (a) so as to reverse (in the output file 7) 
the order of the items within cach group. 

1.24 The accompanying node vector m and connection matrix C together specify 

a directed graph (C;' = | indicates a branch from node / to node /) which is, in 

fact, a tree. 

n = (a,b,c, de. fig 


0000 1 1 0 
0 00 0 0 0 0 
10010 0 0 
c=]0 00 0 0 0 0 
00000 0 0 
00000 0 0 
01000 0 0 


(a) Draw one of the possible ordered trees represented by m and C, 
(b) For the tree T of part (a) show the full left list [T. 
(c) Show the full right list JT. 
1.25 Write programs which include tests on compatibility and which determine 
(a) L =[T from R = JT 
(b) S = }(u/T) from wT), JT, and u 
(c) M = [(u//T) from L = [T and u 
(d) M =[(k [, T) from L =[Tandk 
1.26 (a) Give an example of a well formed right list which demonstrates that a 
prefix of a right list is not, in gencral, well formed. 
(b) Show clearly where the argument used in establishing the well forma- 
tion of any suffix of a well formed Jist breaks down when applied to a 
prefix. 
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1.27. Give formal proofs for the facts that 

(a) a left list groups nodes by subtrees. 

(b) a right list groups nodes by levels. 

(c) £4(T) = (d) — +/d, where d is the degree vector of T. 
1.28 Write programs to determine #(T) as a function of 

(a) the left list degree vector of T. 

(b) the right list degree vector of T. 
1.29 Trace Programs 1.20 and 1.21 for the tree of Exercise 1.24. 
1.30 Show that for a homogeneous tree H, *(H) =¥ . ¥, where y = v(H), 
1.31 If H is homogencous, w(H) = (3, 2, 3, 4), andi = (1, 0, 2), determine, ina 
0-origin system 

(a) the left list index /(4). 

(b) the right list index r(4). 

(c) the index j of the node whose left list index is 27. 


1.32 (a) If K = w%n) | (en) 2 W(1)), show that K + K =nwE — 1). 
(b) If y is any permutation of x and v(x) =n, show thatx |) Kl x = 
Jy : K a J: 
1.33 Using the Euclidean algorithm, write programs to determine: 
(a) das the greatest common divisor of positive integers » and ¥. 
(b) das the g.c.d. of « and y where d, x, and y represent polynomials in = 
(e.g., (2€) 4). 
1.34 To assure uniqueness, the number of different digits (symbols) used in a 
base / number system must not exceed 4. The limitation to the particular range 
0 - a; - 4 is, however, not essential. For example, a base three system can be 
constructed using digits —I, 0, and [, for which it is convenient to adopt the 
symbols —, 0, and +, respectively. The positive numbers beginning at zero are 
then represented by the sequence 0, +, +—, +0, ++, +-—-, +—-0, +-—4, 
+0—, +00, etc. The negative numbers beginning at zero are 0, —, — +, —0, 
Sy bh +0, +—, —0+, —00, ctc. 

(a) Construct addition and multiplication tables for this number system and 
calculate the sum and the product of the numbers 0— and ——. Use the 
decimal system to check all results. 

Negative numbers are represented in this system without the attachment 
of a special sign position. Special rulcs regarding sign are therefore 
banished except that it is necessary to formulate a rule for changing the 
sign of a number, i.c., to multiply by minus one. Formulate such a 


(b 


rule. 
1.35 For any integer n, let, = 2[ya-t3 = 3 lamers = Soa. and a, = 7|qu. 
As shown by Garner (1959), the ordered array ("y, 13, +5. +7) provides a representa- 
tion of the integer # in a so-called residue number system. 
(a) Write the residue representations of the first ten nonnegative integers. 
(b) For integers # in the range 0 -° 7 -° (2 x 3 x 5 x 7) show: 
(1) that the representation is unique. 
(2) that an addition algorithm may be defined which treats the several 
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columns independently, i.e., there are no carries. (The sums must 
also lie within the specified range.) 
(c) Discuss the choice of moduli for extending the range of the representa- 
tion. 
(d) Show that the algorithm derived in part (b) is valid for all positive and 
negative integers in the range ~a/2 <n <a/2fora =2 x3 x5 x7. 
(e) Derive an algorithm for obtaining ~—a from a. 
(f) Derive an algorithm for multiplication. 
(g) The sign of the number (i.e., its relation to zero) is not displayed 
directly by this representation. Convince yourself that its determi- 
nation is nontrivial. 


1.36 Let *, y, and 2 be the positional representations of the numbers .r, ¥, and = 
respectively. Using the floor and residue operations, write programs to deter- 
mine 2 as a function of x and y, where z = 2 + y and the representation in use is 

(a) base b. 

(b) mixed base 8. 

(c) the +, —, 0 base three system (of Exercise 1.34). 

(d) the residue number system (of Exercise 1.35). 


1.37 Write programs for the multiplication z = x x y for each of the cases of 
Exercise 1.36, 


1.38 Write programs to convert in each direction between the following pairs of 
number systems: 

(a) base 6, and base by. 

(b) base b! and base 6°. 

(c) base three and the +, —, 0 base three of Exercise 1.34. 

(d) residue and base b (Exercise 1.35). 


1.39 (a) Show that the superdiagonal matrices satisfy I > "P= 4° M4, 

(b) A matrix of the form J = (21 + 44) is called a Jordan box. Write the 
expansion of the nth power of J. 

(c) Show that ¥ > Y=X,¢ Y14 Xo V2 +--+ 4+ Xyyy 9 VM. 

(d) Determine an explicit solution to the set of linear equations A . x=Yy, 
where u/x = a@ and v/y = b are known and where +/tu + +/v = 
v(A) = 4(A). State the conditions for the existence of a unique solu- 
tion. 


1.40 Any nonsingular matrix A can be reduced to the identity f by a sequence of 
row operations of the form A* «- A’? + yA’, or A’ <> A’. The process which 
accomplishes this (using row operations only) by reducing successive column 
vectors to the successive unit vectors is called Jordan or compiete elimination. 
If the same sequence of row operations is executed upon the identity matrix, it 
will be transformed to the matrix B such that B J A =I1. The inverse of A can 
therefore be obtained by performing Jordan elimination on the matrix M = 
A ~ 1 so as to reduce the first »(A) columns to the identity. The last »(A) 
columns are then the inverse of A. 


(a) Write a program to determine the inverse of A by Jordan elimination. 
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(b) The sequence of operations which reduce the ‘th column of A to e! is called 
the /th step of the process, and the/th diagonal element at the beginning of 
the ith step is called the ith pivot element. Modify the program of part (a) 
so that each step is preceded by a column permutation which yields the 
largest (in absolute value) pivot element possible. This modification tends 
to reduce the accumulation of round-off errors. 
(c) In the Jordan elimination of part (a), it is unnecessary to store the identity 
matrix explicitly, and, since the ith column is first affected at the /th step, 
only one new column need be brought in at each step. Moreover, the ith 
column of A may be discarded after its reduction to e’ on the ith step, and 
it is therefore necessary to store only a square matrix at all times. Show 
that by shifting all columns to the left and by moving the rows upward 
cyclically, a very uniform process results, with the pivot clement in the 
leading position at every step [Iverson (1954) or Rutishauser (1959)]. 
Write a program for the process. 
Modify part (c) to allow the choice of pivot elements as in part (b). The 
effects of the permutation on the not explicitly recorded identity cannot be 
indicated directly, but the performance of the same set of permutations in 
reverse order upon the rows of the resulting inverse produces the same 
result. Verify this and program the process. 


(d 


~~ 


1.41 (a) Show that a group [Jacobson (1951)] can be represented by a square 
matrix M such that each row and each column is a permutation vector. 
(b) Show that M* = M, = i! for some i. 
(c) What are the necessary and sufficient conditions that the group repre- 
sented by M be Abelian? 
(d) Write a program to determine all cyclic subgroups of a group repre- 
sented by M. 
1.42 If Uisa logical matrix whose rows are each nonzero, mutually disjoint, and 
collectively exhaustive (that is, (+/U ~- €) = e, and +//U = e), then U defines 
an m-way partition of n, where m = (U), and na = +(U). The partition is more 
commonly represented by the vector p = +/U [Riordan (1958), p. 107]. Clearly 
+/p =n. Write a program to generate 
(a) all partitions U of a given integer #. 
(b) all distinct partitions of », where U and V are considered equivalent if 
p = +/U isa permutation of q = +/V. 
1.43 Let x bea space vector (.e., of dimension three), and let R(x) be the square 
matrix tf (€ 9 x). Show that 
(a) +/R(x x vy) = (*¥ Ly) x € 
(be, (x xy) =x ly 
(c) (+/R(x x ¥)) L (Ww x &) = (% Oy) x (Ww YL 2) 
(d) (x oy) x (* Ly) = (Ce XY) Le KH) + 21x x Ty) 2 CL ® & Te). 
1.44 Let x-y =(7x x ly) —({¥ x Ty) be the vector product of x and y for 
vectors of dimension three. Show that 
(a) this agrees with the usual definition [Margenau and Murphy (1943)]. 
(b) x-y = —(y- x) 
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(c) *-y is perpendicular to x, that is, x {| («-y) =0. (Use the fact that 
|x = 2f* for a vector of dimension three.) 


= x oy 
f = + he /eng tx4 = —_*"_bet i 
1.45 Let [x] \ (x © x) be the length of x, let x yy area be the cosine of 


the angle between x and y, and let x oy = v1 — (x+y)? be the sine of the angle. 
Use the results of Exercises 1.43 and 1.44 to show that for space vectors 
(a) [«-y] = [*] x [y] x (wo y). Note that [x - 9] is the area enclosed by the 
parallelogram defined by « and y. 
(b) (wy) 2 = (WB) xy —(y fz) xX %& 
(c) (Woy) BSH L(V: 2). 
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MICROPROGRAMMING 


The algorithms to be executed by an automatic computer must be 
described in the restricted set of operations (called instructions or com- 
mands) provided in a given computer, and an algorithm so described is 
called a computer program. Since computer instructions are relatively 
complex, they may be described in turn by microprograms employing 
more elementary operations. 

Microprograms may be used to define a computer instruction set for the 
programmer, to define the detailed algorithms by which the computer 
circuits produce the operations of the instruction set, or for a variety of 
other purposes. In the design and development ofa computer, for example, 
it is important to maintain precise and complete communication between 
the computer programmer, the computer (or sysfem) designer, and the 
logical circuit (or hardware) designer. The system designer will, in fact, 
ordinarily begin with a description at the programmer’s level and proceed 
through increasing detail to the hardware designer’s level. Meanwhile, the 
programmers concerned with evaluating potential performance and with 
developing systems of metaprograms (so-called automatic programming 
systems) should be enabled to follow and to influence the evolving de- 
sign. 

The use of microprogramming will be illustrated by a description of the 
IBM 7090 computer (to be called the 7090) at a level approximately suited 
to the programmer and the system designer. The final section treats 
some problems in the extension to the hardware design level. 

The programs together with the lists of operands constitute a self- 
contained description of the 7090 which, to readers already familiar with 
computer organization and with the relevant sections (1.1 to I.1] and 1.14) 
of Chapter ], should prove readable without reference to the text. The 
text serves only to elucidate the microprograms and does not treat all of 
the instructions described by them. Tables 2.1, 2.13, and 2.14 summarize 
the dimensions, format, and significance of the various operands and 
should be consulted as each is first encountered. 0-origin indexing will be 
used throughout. 
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2.1 INSTRUCTION PREPARATION 


The operation of an automatic digital computer splits naturally into two 
phases which normally alternate: the instruction fetch and preparation and 
the instruction execution, The former involves the selection from some 
information store (memory) of the next instruction to be executed, its 
transfer to one or more control registers, and perhaps some modification 
of the instruction introduced into the control registers through so-called 
indexing, indirect addressing, or relocation. The execution phase begins 
with the decoding of the operation code segment of the instruction in the 
control registers to select the particular execution microprogram to be 
employed, and continues through the execution of the selected micro- 
program upon variables in certain central registers and in certain memory 
registers determined by the address portion or portions of the control 
registers. 

The main memory of the 7090 will be denoted by a logical matrix M of 
dimension 2!) x 36. Selection from M is limited to the selection of a row 
M’; each such row is called a word, and M’ is called word i or register i. 


Dimension 


Memory M 2" xX 36 
Index accumulators I 3.x 15 
Sequence vector S (instruction counter) 15 
Command vector c 36 
Upper accumulator u (s,qg,p,1,2,..., 35) 38 
Lower accumulator (Quotient register) LESS Te Qe aah GO) 36 
Upper accumulator overflow u 
Lower accumulator overflow / 
Trapping mode indicator t 


Instruction fetch mode J 41: skip channel trap 


0: normal 
2: skip trap and fetch phase 


0: no indexing 

Indexing class k%c) 41: normal indexing (15 bit) 
2: restricted indexing (9 bit) 

, : {O: no indirect addressing 
‘ 5 ol: Af ge 

Indirect addressing class AMc) Viectnalivest addressing 

Console start signal (run) r 

Binary representation of = e(2) 


Table 2.1. Central computer operands 
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Each instruction is a full word selected from M, and the sequence in 
which instructions are selected is controlled by a control register called the 
instruction counter or sequence register. This register represents a logical 
vector s of dimension 15 whose base two value determines the word / to be 
selected in the next instruction phase. The quantity* | s is incremented 
after each instruction fetch and therefore selects instruction words in 
natural ascending sequence. The value of s may, however, be respecified 
by the execution of certain branch instruetions. 


c<M-S 


J's <2 11 +s) bo 


Program 2.2 Basic instruction fetch 


The current instruction will be denoted by ¢. It is stored in a 36-bit 
command register. 

The basic instruction fetch involves only the variables M, s, and c, and 
is described by Program 2.2. The second step shows that the inerementa- 
tion of | s is redueed modulo 2! and that the selection of instructions 
from the 2 word memory is therefore cyclic. 


Additive indexing 


It is convenient to the programmer to be able to add (or subtract) an 
index quantity / to (from) the address portion of an instruction in the 
command register ¢ before its execution. This quantity is represented in 
base two by a logical vector @ and is stored in a special index register. In 
the 7090, the data address portion of ¢ is the fifteen-bit suffix w!?/c and the 
indexing is subtractive: 


w/e 215 


(.wh/e — a). 


The reduction modulo 2! again indicates cyclic treatment of addresses. 

The 7090 contains three index registers or index accumulators which may 
be used independently or jointly. They will be denoted by the index 
matri FE of dimension 3 xX 15. One or more (or none) of the index regis- 
ters I’ are selected according to the value of the vector i = (18 | «)/c, the 
three-bit index tag portion of the command, as follows: 


P/E 2P 1 ( wbP/e — | (((18 | @)/e) : D). 
The address in the command register is clearly decremented by the base 


* Since number bases other than two will be used but rarely in the present chapter, 
the elided form = x will be used instead of (2e) __ x. 
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two value of the vector obtained by oring together the selected rows of I. 
The oring of the index accumulators permits simple circuits for their 
selection. It is, however, of questionable value to the programmer, and 
the 7090 index registers are normally used individually. 


Indirect addressing 


It is often convenient (as in the execution of a permutation) for a 
programmer to specify a data address indirectly, i.e., to specify the address 
of the word containing the desired data address. Such indirect addressing 
could proceed through several levels, but in the 7090 it is limited to one. 


Cio A C13 : 0 


wi8/c <— wl8/M- w/c 


Program 2.3 Indirect addressing 


Only the last half of c is respecified by the corresponding portion of the 
selected word, as described by Program 2.3. The occurrence of indirect 
addressing is determined by components ¢€,, and ¢€,, of the operation 
code. 


Dynamic relocation 


The correct execution of any computer program requires that each 
instruction and each operand be stored in the register assigned in the 
construction of the program. A program can, however, be relocated by an 
integral amount n if each word originally assigned to address / is assigned 
to address / + n, and if each address in the program is also incremented by 
n. The incrementation of program addresses can be performed explicitly 
by an assembler or other metaprogram, or it can be performed dynamically 
by an additive index register containing the number #. An index register 
employed exclusively for this purpose is called a base address register. 

More generally, the provision of a table of base addresses permits 
independent dynamic relocation of different blocks of a program, where 
each block is confined to a set of successive registers. This is equivalent to 
one-level indirect addressing in which a certain portion of the address 
(e.g., v/(w?/c)) selects from memory one of a table of base addresses to 
respecify the same portion v/(w!*/c) thus: 


iy, 1D} 5 viol 3/e 
va? /e) — ow? /M MOT), 


If, for example, v = a’, and the format is otherwise as in the 7090, then 
columns 21-27 of registers 0 to (2° — 1) provide the base addresses for 
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suecessive blocks of 2° registers each. The 7090 provides no dynamic 
relocation. 


Branching, interruption, and trapping 


The normal sequence of instructions (fetched from successive memory 
registers) can be interrupted by respecifying the sequence register s. Such 
respecification is performed in the execution phase of certain instructions, 
primarily those called transfers, and skips. The simplest branch ts the 
TRA* (transfer), whose execution effects the following operation 


s<w/e, 


The normal sequence can also be broken by the insertion of an instruc- 
tion in the command register without disturbing the sequence register. 
Unless the inserted instruction is itself a branch, the normal sequence is 
resumed immediately. 

If just before a branch (or insertion) the present value of s is stored in 
some chosen memory register /, then the data in register can be used in a 
subsequent branch to reestablish the original sequence at the point reached 
before the first branch. The storage of s and immediate branch are jointly 
called an interruption. An interruption which is performed automatically 
upon the occurrence of certain special conditions is called a trap. A trap 
provides a convenient device for inserting in the normal program sequence 
a subprogram demanded by the occurrence of the special conditions. 

In the 7090, the so-called channel trap 1s controlled by an 8 x 3 logical 
matrix T whose elements are determined by three different conditions 
existing in each of the 8 iaput-output channels of the computer. A corre- 
sponding enable matrix E (also 8 x 3) and an enable toggle e determine 
which elements of T are effective. 

The channel trap is effected in the first phase of the instruction fetch 
(Program 2.4) as described by steps 2-8. Ifthe matrix eE A T is zero, the 
branch on step 2 skips the trap operation and begins the normal fetch on 
step 9. If not, step 3 determines / as the index of the first nonzero row, 
step 4 stores s in a memory register determined by /, and step 5 stores the 
nonzero row (which indicates the particular condition causing the inter- 
ruption) in another portion of the same register. Step 6 resets the indi- 
cators which occasioned the trap. Step 7 resets the enable toggle e and 
hence (as is clear from step 2) prevents the occurrence of further traps 
until e is again set to one by the execution of a special enable instruction 
ENB. (The reset of e to zero prevents the uncontrolled interruption of 
interruptions.) Step 8 performs the actual insertion by transferring to the 


* 7090 instructions will be referred to by the mnemonic codes used in the /BM Manual 
(1960). 
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command register the content of a second memory register determined by 
j. The sequence register is undisturbed. 

The 7090 can also be operated in a special trap mode which effectively 
converts all transfer instructions (but not skip instructions) into inter- 
ruptions. Discussion of this type of trapping will be deferred since it is not 
relevant to the instruction fetch phase. 


Complete instruction fetch 


The complete instruction fetch comprises three phases: channel 
trapping, the fetch proper, and the instruction preparation by indirect 
addressing and indexing. They are described by steps 2-8, 9-10, and 11-18, 
respectively, of Program 2.4. 

Certain of the three phases may be skipped according to the setting of 
the fetch mode indicator f. In the normal case (f = 0) none are skipped. 
If f = 1, the trap phase only is skipped. This case occurs after execution 
of an instruction such as the RDS (read select), which must be followed by 
a certain auxiliary instruction within a fixed time limit. If f= 2 (a case 
which occurs only after execution of the XEC (execute) instruction), the 
trap and the fetch proper are both skipped, and the command already in ¢ 
is merely prepared by indexing and indirect addressing. In every case, f 
is reset to its normal zero value by step 11. 

Not all instructions are subject to indexing. The indexability of a 
command cis determined by a c/ass function kc), which assumes the values 
0, 1, or 2, according to whether c is subject to no indexing, normal indexing 
(affecting all fifteen bits of the address), or restricted indexing (affecting 
the last nine bits of the address), respectively. This behavior is determined 
by the branch on step 13. 

A second class function k!(e) determines whether the instruction c is of 
a type subject to indirect addressing. Actual indirect addressing of any 
particular instruction of the appropriate type is initiated by the configura- 
tion Cy. = 1 and ¢,, = 1. The function A1(c) is itself independent of c,, 
and ¢)5. 

The class functions kc) and &'(c) will themselves be specified by 
prefacing the mnemonic code of each instruction described by a pair of 
digits. Thus, 

11 CLA 


indicates that clear and add is subject to both indexing and indirect 
addressing, and 
10 CHS 


indicates that change sign is subject to indexing but not to indirect ad- 
dressing. 
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wl 


nN 


eEA T:0 = 
pr tel A (eE AT) 
M102) <— q!5(36)s 

(15! o8)/MNO12) «- (eE A TY 
T) <0 

e <0 


fo ee M112 


c«M:5 
s< 21) /(] + J 8) 


ee ee 


. 
Oo ‘oO oO ~ a ar ao ies) 


12 a<- k\(c) A CypACy3 
13 Oke) I 
< > 
14 Lowlife 215] (i w/e — GY D) 
15 | w/e <— 29| Ci we — - i ¥ w/1)) 
16 a:0 alee 
17 w!8/e <— wl8/M:! wife 
18 Uj a<-0 


i = (18 | a)/c 


Program 2.4 Complete instruction fetch 


The phases of instruction preparation are performed in the following 
order: 
indexing (if indicated); indirect addressing (if indicated). 
Moreover, if indirect addressing is performed, the new address ts itself 
re-indexed (if indicated). As shown by steps 12, 16, and 18, the indirect 
addressing is limited to a single level. 


2.2 INSTRUCTION EXECUTION 


The execution phase begins with the “decoding” of the operation part 
of the command c to select the appropriate microprogram to be executed. 
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Except for the format of the operation code* (which in the common case 
occupies the prefix a!*/c) the details of the decoding are, however, of no 
interest to the programmer, and attention will be confined to the execution 
microprograms. These may be grouped into a small number of families; 
for the 7090 they are /oad and store, branch, logical, arithmetic, shift, 
convert, and input-output. 

Certain of the arguments and results of the computer instructions are 
represented by three central data registers to be denoted by J, wu, and d. 
The registers uw and I serve as accumulators in the addition and other 
arithmetic operations, and, since wu and I jointly represent double precision 
numbers (i.¢., Carries are in some operations propagated between the high 
order end of J and the low order end of w), they will be called the upper and 
lower accumulator, respectively. Since I receives the multiplier in a 
multiplication and the quotient in division, it is called (in the 7090 manual) 
the Multiplier-Quotient or MQ register, and the letter Q occurs in the 
mnemonic code for instructions affecting it. 

Signed numeric quantities are represented in base two with the sign in the 
first component, i.e., register / represents the quantity y = (1 — 2 M,') x 
(| @!/M‘). The lower accumulator I is, like each memory register, of 
dimension 36, and the sign of a numeric quantity is represented by J). 
The upper accumulator is of dimension 38 and represents the number 
(1 — 2uy) x (| at/u). The two extra components u, and ty are called 
overflow positions and are excluded from normal transfers of data from wu 
to the memory. The component uy (called the p-bit) is, however, included 
instead of the sign bit u, in certain /ogical instructions. The component 
u, (called the g-bit) is made accessible only by certain shift operations. 

The register d (distributor) serves only as intermediary in transfers 
between main memory and the central data registers u and J and is not 
accessible to the programmer. 


Load and store 


In each member of the family of basic load and store instructions 
(Program 2.5), the memory word involved is selected by w/e, the address 
portion of the instruction. The instruction STA stores only the address 
part of u, and STD stores the decrement part, so called because it is used 
in certain instructions to specify the amount of decrement to be applied to 
an index register. The STP stores the p-bit and the first two digits of the 
magnitude part of u; that is, the three-bit prefix of the /ogical part of u 
which enters into logical operations. The STO instruction stores the 


* The opcration code representing instruction x is a logical vector to be denoted by 
e(x). Thus e(CLA) = (000101000000). 
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Load Q lt LDQ —+} 1<-M ele ia 
Store Q 11 STQ —sS Mee «oT |__s, 
Store left half Q Hl SLQ — + al8/M’ w/e 6. g lB i] ener 
Store 11 STO —>» M.#%le — (| @)/u pare 
Store zero 11 STZ — Mele 0 faa 
Store address I} STA —3 wh/M eM wl /u bos, 
Store decrement It STD —} (3) a!5)/M wie (5 | al)/u - > 
Store prefix tt STP —a o8/M elle < (2 | o8)/t | 
Store tag 1] STT (18 | a3)/M-e le «(20 | o3)/u Ls 
Store instruction 

location counter 11 STL wl5/M | wlio 6g Resear 
Store logical word 11 SLW M &le Siu |» 


Program 2.5 Load and store instructions 


normal numeric part of w (that is, all but the overflow bits), whereas SLW 
(store logical word) stores the p-bit instead of the sign. 

The instructions which load and store the index accumulators (Program 
2.6) are of four types, as indicated by the leading letter of each of the 
mnemonic codes, L for load index from memory, S' for store index in 
memory, A for load index from the address of the command register, and 
P for place the index in the upper accumulator or the upper accumulator 
in the index. The portion of memory, command register, or upper 
accumulator involved in each of the ten instructions which specify the 
index is shown in steps I-10. The last five of these differ from the corre- 
sponding members of the first five only by complementation on 2", as 
shown in step 11. Since the subtraction occurring in indexing (step 14 of 
Program 2.4) is reduced modulo 2”, the effect of complementation is to 
add rather than subtract the quantity used to load the index accumu- 
lator. 

Step 12 shows that the index accumulators specified are selected by the 
three-bit tag vector #=(18 | @*)/cand that each receives the same specifying 
quantity. Since the tag vector is used to select the index registers to be 
specified, it cannot also be used to specify indexing of the instruction 
itself, and, consequently, none of the load and store index instructions are 
indexable. Neither do they permit indirect addressing. 
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| 


Load index from address 00 LXA <— wld | w/e I 


Load index from 


decrement 00 LXD — *+-(34 aS)/Mi etic 2 
Address to index true 00 AXT — * <w/e |_| 3 
Place address in index 00 PAX 4 x < wll Met 4 
Place decrement in index 00 PDX — * <—(5 | a!)/t | § 
Load complement of 

address in index 00 LAC «- wh5/M.w/e H— | 6 


Load complement of 


decrement in index 00 LDC x (31 ad) Miele |} 7 


Address to index 


complemented 00 AXC x <-wlic |_|] 


Place address in index 


complemented 00 PAC x <wl/u LJ} 9 


Place decrement in 
index complemented 00 PDC 


Pit. Mt 25 Dl 


x <- (5) ol) /u 


Je <= 215 | (— 1 x) 


fe 
<i i//T «-e( 4/8) 2x eam eee 
Store index in address 00 SXA wl /M ele iV I |, 13 


Store index in decrement 00 SXD —} (34 el )/MePe <i TL 4 14 


Place index in address 00 PXA yt < w!(38)\@ 5 1) >» 15 
Place index in decrement 00 PXD —¥ t <-(5j 09(38))\G: 1) -—> 16 


(i = (18 | a/c) 


Program 2.6 Load and store index instructions 


The last four steps show the storing of the index accumulators. The 
quantity stored is the or function of the accumulators selected by the tag 


(18 | a&)/e. 
Branch instructions 


The basic branch instructions are of two main types, the transfer 
.denoted by a leading Tin the mnemonic code) and the skip. The behavior 
of the skip instructions is shown in steps I-10 of Program 2.7, and is 
typified by the PBT (p-bit test) of steps | and 10. If the p-bit of the upper 
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accumulator is not zero, the sequence vector is incremented so as to skip 
the next instruction in the sequence; if the p-bit is zero, the instruction has 
no effect. The various skip instructions differ in the particular tests made, 
and the last two (CAS and LAS) differ also in providing three alternatives, 


p-bit test 10 PBT _ S< thy tI 
Low order bit test 10 LBT S4> Uy, — 2 
Storage zero test 11 ZET — ys eA Mele — 3 
Storage nonzero 11 NZT—3 »«-@% Miele —- 4 
test : 
Compare accum- 11 CAS — d«.M- ele 5 
ulator with storage . x 
s<-u \ (Cloyd) + 2(u Aa!) A (d 46) 6 
s<-2(1 — 208,) 2 fee CL = 2d) afd) 7 
Logical compare 11 LAS 4] d«- @/u— 1M le 8 
accumulator with 
storage s< (4 0) + Gi 0) 9 
J Ls<- 2+ 18) r— 10 
Execute IL XEC—) com ele ul 
f[<-2 lL—> 12 
Store location and 00 STR —» w/M%<—s 13 
trap 
$<-2 L—» 14 
Leave trapping 10 LTM—+4 7<-0 > 15 
mode 
Enter trapping 10 ETM—+» 7¢<-l H—> 16 
mode 
Trap transfer 11 TTR —>+} s<wl/e —»17 
Console clear or —y (fiuha—-0 —> 18 
reset 


Program 2.7 Skip type and other special branches 


skipping 2, I, or 0 instructions according to whether the quantities com- 
pared stand in the relation <, =, or >, respectively. 

When operating in the nontrapping mode (¢ = 0), the essential operation 
of the transfer type of branch (Program 2.8) is the (conditional) respecifi- 
cation of the sequence vector s by the address portion of c. The HTR 
(halt and transfer) also suspends operation of the computer until a run 
signal is received from the console (steps 27-29). 

In the trapping mode, all transfer operations are converted to inter- 
ruptions; the sequence vector is first stored in register zero, and a 
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Transfer on index high 00 TXH —— 6<-(L@ x DT 1G 4 a)/e) 


Transfer on index low 00 TXL ——» 6<-(L@ ¥ 1 1.G Lae) 


or equal 


Transfer on index 00 TIX 


Transfer on no index 00 TNX —> 


Transfer with index 00 TXI 


incremented 


Transfer and set index 00 TSX 
Transfer ll TRA 
Halt and transfer 1] HTR 
Transfer on zero Il TZE 
Transfer on nonzero 11 TNZ 
Transfer on plus 1] TPL 
Transfer on minus 11 TMI 


Transfer on overflow I! 


Transfer on no over- It 
flow 


Transfer on Q plus 1! TQP 
Transfer on Quotient Il TOO 
overflow 

Transfer on low Q {1 TLQ 


(G18 a)/c) 


— 


b< (LEY D - 1G pate 
yl GX D—16 lL arye 
LiT<— 2° |ne(+/p 

b<«-(1 GX D1 GB Late) 
yo (GY D+ 1B layje 

| t//T«- QP {ye(t /i) 

J i//1<-2?\(— 1 s)e(+/i) 


—> b<-1 


b«-(alju) / 


b« (ai/us) Y. 


mi 


mi 


bx thy 
by 
(6, w) <- Ww, 0) 
(6, 1) «~ (aa, 0) 


b«- i, 
(b, 1) <- V0) 


b<-((L — 2hy) Latfl-< Cl — 2uy) | ot /ts) 


bby (Naya (uh &)) 
tid 

56:90 

S<-whle 

w>/M" <— 

56:0 
js«-l 
a/c : p(HTR) 
r<0 


ri 0 


Program 2.8 Basic transfer type branches 


2 
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(conditional) branch is made to register one. This behavior is uscful in 
trace programs. The conditions for the various transfers are indicated 
in the setting of the logical variable b. Special indicators (such as the 
overflow toggle w) are resct by the transfer instructions which they control. 

Transfers based on the condition of the index accumulators (steps I-9) 
are combined with modification of the index registers. The quantity from 
the index accumulators is again the or function of the accumulators 
selected by (18 | &)/e. This quantity is compared with the decrement part 
of the command (that is, (3 | «!5)/c)to control the conditional branches and 
is decremented or augmented by the decrement part to modify the selected 
index accumulators. 

The TSX (transfer and set index) inserts the complement (on 2!>) of the 
sequence vector into the selected index accumulators before effecting an 
unconditional transfer. This instruction is convenient for incorporating 
closed subroutines or other interruptions, since a subsequent TRA 
(transfer) with a zero address and indexed by the same index register 
restores the program sequence to the point of interruption. 

As shown in Program 2.7, only the TTR (trap transfer) is exempt from 
trapping. The trap indicator is set by the ETM (enter trap mode) and is 
reset by the LTM (leave trap mode) as well as by a console clear or reset. 
The XEC (execute) instruction performs no operation upon the central 
data registers but inserts in the normal instruction sequence (without 
breaking it) the instruction in the register specified by the data address 
aceompanying the XEC. This is effected by simply loading the specified 
register into ¢ and (by setting f = 2) skipping the trap and the fetch proper 
of the instruction fetch phase. 


Logical instructions 


The logical operations (Program 2.9) concern the logical part of t, 
which differs from the numeric part by including the p-bit rather than the 
sign, and hence comprises a@?/u. The first instruction of the family (ORS) 
produces the /ogical or of the word selected from storage with the vector 
@’/u and returns the result to the same location in storage. The instruction 
ANS (and to storage) is similar. In the ORA (or to accumulator) the 
result u is of dimension 38, and the second operand (that is, M °'”!*) is 
expanded to this dimension by inserting two leading zeros before oring 
it with w, The instruction ANA is similar. It is easily verified that ORA 
leaves the extra bits unchanged and that ANA resets them. The ERA 
(exclusive or to accumulator) is anomalous in that it resets the extra bits. 

The ACL (add and carry logical) is normally used only in certain parity 
check algorithms. It adds the selected word to the logical part of w, 
treating both as 36-bit positive base two numbers, again adds any resulting 


84 Microprogramming §2.2 


Or to storage I} ORS —} Mele < (@2/u) yy Melle Ly 
Or to accumulator Il ORAS} uweu y (A\M ee) La 
And to storage 1} ANS —} M-e/¢— (@/u) A, Mie¥le Lt 
And to accumulator 1] ANA —y w<eu A (@\M ee) eee 


Exclusive or to accumulator 1! ERA - u<—(u ¢ (@\M #"/e)) 


a/u —0 /— 
Complement magnitude 10 COM — a@lju <al/u |_> 
Clear magnitude 10 CLM = al/u <0 = 
Change sign 10 CHS Uy <— Uy [—> 
Set sign plus 10 SSP a uy, <0 |__» 
Set sign minus 10 SSM —4¥ uy <1 |—> 
Store logical word IL SLW — MLe% le eu — 
Clear and add logical Il CAL u <~@\M Lovie leas 
Add and carry logical i} ACL zi y — 1 (@/u) + 1M Lele 


La? <— (2 |y) + (y 2 2) [> 


Program 2.9 Logical operations 


overflow to the low order end, and places the result (which will not exceed 
25 — |) in the logical part of u. The behavior of the remaining logical 
instructions is evident from Program 2.9. As shown by the class functions 
k° and k!, five of them do not permit indirect addressing. * 


Arithmetic instructions 


The description of arithmetic instructions will be illustrated by the family 
of fixed pointt add instructions shown in Program 2.10. The CLA (clear 
and add) transfers the selected memory register to uw, resetting the two 


* Since each of these five instructions involves the accumulator only, the normal 
address portion w!*/e does not represent an address, and its last three components are 
used in representing the operation code itself. The possibility of indirect addressing 
would suggest to the programmer the nonexistent possibility of specifying an arbitrary 
indirect address tn w!?/c. 

+ The 7090 incorporates three arithmetic functions: addition, multiplication, and 
division, each of which may be performed in either a fixed (radix) point or floating point 
mode. 
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Clear and . 

add 11 CLA u<—(] BM eric L—> | 

Clear and | 

subtract 11 CLS | u<-(| @)\(al M2.# ie) — 

Add 1) ADD— d<—M. ele 3 

Subtract It SUB —} d <—(al 4 M.e¥/) |_| 4 

Add 

magnitude 11 ADM | d~a@aM ele 5 

Subtract 

magnitude 11 SBM —y dol y Mele 6 

Round lt RND— d <u, y io! 7 
2 (1 — 2ttg (au) + (1 — 2d,)(.-@/d) 8 
Ue Uy 9 
Bue <— 287 | (|2|) 10 
warn ((v # Uy) A(Ug = dy)) 1 

|“ <—(@ < 0) Vv (ty Al = 0)) 12 


Program 2.10 Add instructions 


overflow positions. The CLS (clear and subtract) differs only in that the 
sign is reversed in transfer. 

The instructions ADD, SUB, ADM, and SBM each transfer the 
selected word to d with an appropriate sign, add it to the number repre- 
sented by wu (including the overflow positions), and place the sum reduced 
modulo 23’ in u. The sign of a zero result is (as indicated by step 12) the 
sign of the number originally contained in u. 

The overflow indicator wu is set only by a carry (but not a borrow) from 
u, to u,. This indicator controls, and is reset by, certain branch in- 
structions. 

The RND (round) instruction is used to round up the magnitude of the 
number represented jointly by the upper and lower accumulator by one 
unit in the high order position of the lower accumulator. As shown in 
Program 2.10, the content of the upper accumulator only is actually 
affected. 


Shift instructions 


In describing computer instructions, the term /eft shift of a vector « by 
places refers either to the left rotation «<r {* or to the left rotation 
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combined with a reset to zero of the last r “‘vacated positions,” that is, 
x<—(rf x) Aw’. 


Both types of shift occur in the 7090 and concern various portions of the 
entire accumulator wu @ I, as shown in Program 2.11. The portion affected 
is determined by the mask vector m. 


uu V (a + (mj(u oD) 
miu OD) <-(¢rt@ni(u © dD) Aw 


Accumulator left 


shift 10 ALS m < (38) © €(36) 

Long left shift 10 LLS m «- (38) © a1(36) 
uy, << l, 

Logical left shift 10 LGL m ~«- @(38) © €(36) 

Accumulator right 

shift PO ARS «- @(38) “> €(36) 

Long right shift 10 LRS — ™ < @(38) © a(36) 
ly <- Uy 


Logical right shift 10 LGR —y¥ ™m ~ @1(38) © €(36) 
4mi(u ob)<-(rj{ (mu Ob) Aa 


Rotate MQ 10 RQL — i<rfl |__». 
Exchange accumu- _ 
lator and MQ 00 XCA — (4 @)/te<> 1 
(| @)/u <—0 
Exchange logical 
accumulator and MQ 00: XCL wu I 
az/u <0 |_,. 
r= Lw/c 


Program 2.11 Shift instructions 


The first three instructions are left shifts. Each sets the accumulator 
overflow indicator if any nonzero bits are lost, i.e., if any of the first r 
positions of the affected portion are nonzero. The next three are analogous 
right shifts, which do not, however, set the overflow indicator. In the 
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“Jong” shifts LLS and LRS, one sign position specifies the other, although 
the sign positions are otherwise excluded from the shift by the mask m. 

The LGR shifts all positions save the sign of u; RQL rotates MQ 
without resetting any positions; and XCA, which “exchanges” the 
accumulators, is effectively a rotation except that it resets the overflow 
bits. The amount of shift r is in each case determined by the base two 
value of w/c. 


Convert instructions 

Each convert instruction (Program 2. |2) selects a six-bit infix of one of the 
accumulators, adds its base two value to a “base address” initially specified 
by the address portion of the instruction, and selects the memory register 


<=1 Cy : 0 1 

4 Pol /d 2 

ie EG jo U0} a/c 3 
d<c 4 

— 5} 1-641 5 

; oe aa 6 

a 2\( wld + abl) | | 7 

d «Me 8 

alic : e(CAQ) ro 9 

| all aja 10 

Lo} aly — 237\(at/ue + d) I] 1 

00 CVR —> Jt = (10) @®)/e 12 
d<c 13 

pepe 14 

a <= 235|( i wl5/d + w/t) 15 

d<—M¢* 16 

aij <-~ (6 | (at/u)) A ae 17 

aS/a? jiu < (aS /a2/u) \) ab/d | 18 


Program 2.12 Convert instructions 
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specified by the resulting address. Part of the selected register is used to 
respecify the base address and part to modify one or other of the accumu- 
lators. The process is reapplied to successive six-bit infixes in cyclic order 
a number of times determined by the base two value of (10 | @)/e. If 
Coy = 1, the last fifteen bits of the last word selected in the operation are 
transferred to index accumulator J°. 


Input-output instructions 


Because the data transmission rates of input-output equipment serving 
a computer are much lower than those of the computer proper, computer 
systems are normally designed to permit concurrent operation of the 
computer and one or more input-output units. The units are therefore 
more or less autonomous. 

In the 7090, the autonomy of input-output equipment is carried further 
by providing eight data channels which transmit data to, and are controlled 
by, the computer proper, and which in turn control and accept data from 
the individual input-output units. The entire input-output process there- 
fore comprises three levels of microprograms: a semiautonomous input- 
output unit controlled by a semiautonomous data channel controlled by 
the computer proper. 

Attention will be restricted to the magnetic tape input-output units of 
the 7090. Each unit is available to one specific data channel / (for f= 
0 — 7), and a particular unit can be characterized as the file D,’. The unit 
is completely autonomous only in certain simple operations, such as 
rewind, write end of file, backspace, and continue to end of record. Except 
for these special operations, a given data channel can control only one of 
its files at a time. The eight data channels may, however, operate con- 
currently under control of the computer proper. 

Each channel / behaves as a subcomputer with its own sequence vector 
S’, command vector C’, data register D’, and other miscellaneous operands, 
as Shown in Table 2.13. The instructions of the subcomputer (listed in the 
matrix K) are called channel commands and differ from the instructions of 
the computer proper in both format and function. 


Tape Units. Each tape unit behaves as a file (Sec. 1.22); each recorded 
component is an alphanumeric character represented in a seven-bit odd- 
parity error-detecting code, the lowest level partition A, is represented by 
the intercharacter space on the tape, the second level partition A, (called 
an end of record gap) is a longer blank space on tape. 

Each record gap is immediately preceded by a parity check character 
which is appended to the normal data of the record to permit an even 
parity “longitudinal” parity check on each of the seven-bit positions of the 


Dimension 


Channel data registers D 8 x 36 
Channel sequence vectors § 8 x 15 
Channel command vectors C 8 x 36 
[ 1, : End of file A, 
Channel trap rT: T,’ : Parity check 8 x3 
{ T,? : Channel command 
Channel trap enabled E 8 x 3 
Channel trap enabled e 
Tape position limits L_ L*: (Beginning, End) 8 x2 
Limit position on tape vy (Determined by reflective marker) 
Busy indicator b 8 
Write or read indicator w 8 
Tape unit index t 8 
[° : Normal read-write 
Functions f j 1 : Backspace record or write end of 
file 8 
| 2 : Backspace to file mark 
3 : Rewind 
Load channel waiting r (reload) 8 
Write record gap (A,) next g 8 
Current character X X’ js the 7-bit representation 8x7 
Current parity check Y 8x7 
Interlock vector x sw, = 1 if character X* is loaded 8 
Current character selector WV  V*/D? is current character 8 x 36 
End of file indicator Q Q* : (Counter, Potential error) 8 x2 
Input-output indicator h 
c d\ IOCD 
- -—\|TCH 
rp |IORP 
Channel commands K rt HORT 
c p | LOCP 
et | IOCT 
s p} IOSP 
s t/ IOST 


Table 2.13 Channel operands 
89 
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preceding record. The check character is recorded automatically and when 
read from the tape is used in the parity check but is not transmitted with 
the data of the preceding record. 
The third and highest level partition (called end of file) is represented by 
a special recorded character A, which has the seven-bit representation 
e(A,) = (0, 0,0, 1, 1,1, 1). It is recorded together with the appropriate 
check character (which, since the check is of even-parity is also A,) as a 
separate record. The character A, alone is not recognized as an end of file 
partition; only the sequence Ay, Az, Ay, Ay is so recognized. Tapes are 
normally stopped only at a record gap so that, on restarting, the tape ts 
fully accelerated before the end of the record gap (and hence data) is 
reached. 
Dimension 

Character buffer 

Partition buffer 

Logical association (connection) 

Busy indicator 

Write-read status 

Function status 

End of file counter 


8 x # of units per channel 


yy So DWN 


Ay intercharacter gap 
File partitions A{A, inter-record gap 
a, end-of-file symbol (0001111) 


Table 2.14 Input-cutput unit operands 


The tape unit parameters are listed in Table 2.14, and the operation of 
tape unit @,’ is described by Program 2.15. The unit idles at step 5 until its 
busy indicator B,’ is turned on by data channel i. After a starting delay of 
about 650 microseconds required to accelerate the tape and reach the 
beginning of the record, one of four functions (listed under f in Table 2.13) 
is performed as determined by the function indicator F,’. 

If F,’ = 0, a normal read or write is performed under direct control of 
the data channel as detailed in steps 18-37. If F,’ ./ 0, one of the several 
completely autonomous operations is initiated. If F,’ is not zero and 
W,’ (write indicator) is unity, the autonomous function write end of file 
is performed by steps |-3, after which the busy indicator is turned off and 
the unit returns to idle status. The end limit indicator L,' is set by step 3 
if the tape position exceeds a limit » set by a reflective marker attached to 
the tape a short way before its extreme end. 

If W,’ = Oand if F,’ = 1, 2, or 3, the unit backspaces to the next earlier 
record gap, to the next earlier end of file position, or to position zero, 
respectively. The last is called rewind. If, in backspacing, the tape becomes 
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10 


®,! 
®,? 
L,? 
B;i 
B,i 


Ao, Ag 

<~Aeg, Ay 

<— Ly! y (n(®,*) > 9) 
«-0 
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starting delay 


F;i 
Ww; 
R,! 


n(®,*) : 0 


L,? 


i 
Z;', 


R; 


Rj! —(Z;' = Ay) x (Rj — 1) 


P,! 
F;! 
R; 


: 0 
: od 
<3 


+ Ly’ v (F;' 4 3) 
P; +10; 
: 0 


2 Ag 
: 2 
: J 


& 


Loy 


Di <P; 

X? « 9(Z;') 
x,<1 

Pi td, 

record gap delay 
A; : 1 

e(Z;") — X* 
x,<1] 

D,' <- Z;', Ay 
L,' Ly’ v @@,;') > ») 
g;, : 90 

e(Z;*) — ¥? 

Df <- Z,4, A, 
g,<- 90 

record gap delay 
A; : J 


Program 2.15 Tape unit ®,’ 
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Ty’ + T,t vy &; A(R, = 1) 


ki: p 


47;A(Ry =t): 1 


(b;, A;) <—0 

(b; V B; ) : 1 

(BL, Fi,Wi, x) —(F, ws wd 
f,: 0 

delay 


r, -0 


(D; © Aj, ® Q*) — (Ag, 1, 3, 0) 
(¢; ®r; 2 Y) <-0 

Ci MLS 

Ls? —24i] + Ls 

a8/C* : e(TCH) 

Si? —wh/ci 


(Lal8/Ci = 217) y(Lal8/Ct = 6 x 215) 2 


Di~0 

Vi — a8 (36) 

(Ro Ar) ACB) ayCt Ne): 1 
(Ry # c) ACP; =%) : 0 

D, Ay 

x; : 0 

x; <0 

Yi —(¥! # X*) 

Ty Ty’ vy (=/X*) 

(S/K) AES 21 

Q,' : 0 

Qo’ — (XS p(y) x (Q,° — 1) 
Q,’ — Qy' V (Qy* > 0) 

(Q)' = 1) A(p; =A) : 0 

T,' <1 

Ty’ + Ty’ Vv (Q;' A(Qo? = 0)) 
Ey’ A Qy' A(Qy? =0) : 1 
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x, 0 
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DP; : 

Q? <— (3, 0) 

T+ Ty vy (v/¥4) 
(V/Y) AEE: 1 
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Q,' : 0 
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Vi: of 
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rewound before the appropriate partition is found, the process is terminated 
and the beginning limit indicator L,’ is turned on (steps 10-11). 

The file is read backward repeatedly by step 12. When a record gap 
occurs, step 16 is executed, and if F,‘ = |, the branch to step 4 returns the 
unit to idle status. If F,’ = 3, termination can occur only from step II, at 
which point the tape is rewound. The counter R," is used to detect an end 
of file partition. It is reduced by one (step 14) if the character read is A, or 
to zero if it is not. Since R,’ is set to 3 after each record gap, step 17 is 
reached with R,‘ = | if and only if the end of file sequence A,, Ay, Ay, A, has 
occurred. 


8 
< 
5 
J 
= 
1 
a 


i<(_QOf w/c) — 1 
bs 1 

t, — Lw/e 
fick -3(k =4) 
w,—(k =4) 


! 
0 Oo I A AU BR WN 


a 
°o 


b,<1 — 


Program 2.17 Instructions for special tape unit functions 


Before completing the discussion of the remaining functions of the tape 
unit, it may be helpful to follow through the entire process initiated by the 
BSR (back space record) instruction. The channel idles (Program 2.16) on 
step 5 with the busy indicator b, off. The BSR instruction (Program 2.17) 
first determines the index / of the channel addressed, waits on step 6 until 
the selected channel becomes free, sets the tape index ¢, = / to select the 
particular unit ),’, the function indicator f, to unity, the write indicator w, 
to zero, and the busy indicator b, to unity. This last action initiates oper- 
ation of channel /, which, as soonas unit D,‘ becomes free(B,’ = 0), executes 
steps 6-7 and then (since f, > 0) returns the channel immediately to idle 
status. Step 6 transfers to the parameters of the selected unit the relevant 
channel parameters which were themselves specified by the BSR instruction. 
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Step 6 also makes the selected unit busy (B,’ = 1), and hence starts it in 
operation. 

The normal read-write functions of the tape unit are described by steps 
18-37. They are initiated by a branch from step 7 in the event that the 
function indicator is zero. Over-all control by the channel is exercised 
primarily through the association indicator A,‘, which is equal to unity if 
unit / is logically associated with channel /, that is, if data are permitted to 
flow between them. If writing is in progress (W,’ = 1) and A,’ becomes 
zero, the unit stops; if reading ts in progress, the channel is freed immedi- 
ately but the tape continues to a record gap. 

Step 20 performs the read from tape, and if A,‘ = I, the partition read 
is transferred to p,, the seven-bit representation of the character read is 
transferred to X’ (both for use by the channel), and the channel-unit 
interlock #, is set to unity to initiate appropriate disposition of the 
character by the channel. If P;‘ is not a record gap reading continues, the 
intercharacter delay (not shown) permitting time for the channel to 
dispose of the character before the next is actually read. If P,’ is a record 
gap, the corresponding delay elapses before A,’ is tested. If Aj‘ = 0, 
the branch to step 4 stops the unit. The tape stops only at a record 
gap although transmission of data may be discontinued earlier by step 
21. 

The writing process begins at step 37 and may be discontinued before 
any writing occurs (although the current record gap will be lengthened by 
a few inches of blank tape). The main writing is performed by the loop 
28-32, employing the channel interlock ¥,. Step 3} sets the tape end limit 
indicator. The loop terminates (step 32) when the write record gap 
indicator g, 1s set to unity by the channel. Steps 33-36 then write the 
Jongitudinal parity check character Y‘ supplied by the channel, together 
with the inter-record gap partition A,. The write loop is then re-entered 
unless A,’ = 0. 


Channel operation. Operation of a channel is initiated either by one of the 
special functions (WEF, BSR, BSF, REW) already described, or by a 
WRS (write select), or an RDS (read select). The loading of the channel 
command C’ required to control the two latter functions is, however, 
controlled by a subsequent RCH (reset load channel), which transfers to 
S' the address in memory of the desired channel command. 

The WRS (Program 2.18) selects the channel / specified by a portion of 
its address, waits until the channel is free, sets its tape unit index ¢, as 
specified by another portion of the address, sets the write indicator w, to 
unity and the function indicator f,; to zero, and, finally, sets 6, to start the 
channel. The fetch mode indicator f is also set to one so as to skip the 
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Program 2.18 Select unit and load channel instructions 
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channel trap on the next instruction fetch. This prevents a trap from 
intervening between the WRS and the following instruction (which is 
normally an RCH). The RDS differs only in the setting of w, on step 4. 

If the channel is not busy (i.e., not selected), the RCH instruction 
(Program 2.18) selects the channel specified by a portion of the operation 
code, sets the input-output indicator 4, and copies the instruction address 
to the channel sequence vector S’. If the channel is busy, the RCH 
instruction sets the selected channel to its step 9, whereupon the channel 
waits on the interlock at step 10. Meanwhile, step 18 of the RCH sets 
S‘ and step 19 sets the interlock r, so that the channel may proceed. 

Steps 13 and 14 of the channel operation load the channel command 
register C’ and increment the channel sequence register S‘. If the command 
isa TCH (Transfer in Channel), step 16 causes a branch to a new sequence 
of commands. If not, the werd count, represented by (3 | a!)/C’, is 
tested. If itis zero and if the current command is either an LOSP or [OCP, 
the branch to step 13 immediately fetches the next command in sequence. 
Otherwise, indirect addressing of the command occurs (step 19) unless 
C},, is zero, 

Step 20 specifies k according to the class of the command being exccuted. 
The commands are listed in the matrix K of Table 2.13. 

The first component K,' assumes the value c, r, or s according as the 
command having code /is terminated by a word count test, a record gap, 
or by either (signal). The second component K,‘ assumes the value d, p, 
or t according as the channel discontinues operation, proceeds to the next 
command in sequence (as determined by S$"), or transfers to an LCH 
(Load Channel) instruction which may be awaiting execution by the 
computer proper. Execution of the LCH (Program 2.18) is delayed at 
step 1] and branches to step 18 (to respecify S’ in the manner of the RCH) 
only if the channel reaches step 3. 

Channel operation continues on the right-hand segment (steps 35-65) if 
the operation is a read (w, = 0), and on the left (steps 22-34) if it is a 
urite. In the latter ease, a zero word count causes immediate termination 
of the current command. 

The normal termination of a command in either read or write mode 
occasions a branch to step 1, where the tests for continuation begin. Step 
1 sets the Channel Command Trap indicator T.,' if the current command is 
of the transfer type and an LCH (Load Channel) is not awaiting execution 
in the computer proper. If the command is of the proceed type, step 2 
branches to step 13, where the next command in sequence is fetched. If the 
command is of the transfer type and an LCH is waiting (7; = 1), step 3 
branches to step 9 to reset parameters and permit the channel to be reloaded. 
In all other circumstances step 4 is executed to disassociate the unit from the 
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channel and to return the channel to idle status. In read status, certain 
abnormal events—the occurrence of a parity error, or an end of file 
partition—return the channel to idle status immediately, regardless of the 
type of command being executed. 

The write operation (steps 22-34) is relatively simple. If the word 
count in (3 | a)/C’ is zero, steps 23-24 terminate the current command 
but first initiate the writing of an end of record gap* if the command is of 
the ‘‘record”’ type (e.g., an LORP). If the word count is not zero, step 25 
transfers to the channel data register D’ the memory word selected by the 
address portion of the command. The loop 28-33 transfers to the tape 
unit successive six-bit infixes of D’ and maintains the longitudinal parity 
check Y‘ (originally reset on step 12). When all six have been transferred, 
the branch to step 34 decrements the word count and, unless it becomes 
zero, repeats the entire process from step 25. 

The read operation (steps 35-65) begins by resetting D’ to zero and the 
infix selector V‘ to a. Step 37 terminates the current command if it is of 
the count or signal type and the word count Is zero. Steps 38-39 terminate 
the command if it is of the record or signal type and if the last file partition 
read is a record gap. The partition indicator p, is reset to Ay by step 39. 
Thus a record gap present when termination is caused by a zero count is 
still present on the first execution of the succeeding command, whereas a 
gap which itself causes termination is not present on the succeeding 
command. 

Steps 40-43 show the data interlock, the determination of the longi- 
tudinal parity check, and the setting of the parity error trap T,’ in the 
event of a parity error in the character. If the corresponding channel trap 
is enabled, step 44 causes immediate termination in the event of a parity 
error. Steps 45-48 detect an end of file configuration (using a counter, 
Q,' ina manner similar to that used in Program 2.15), set the indicator 
Q,' if a partition character A, appears at the beginning of a word, and 
cause termination (from step 49) with the end of file trap Ty‘ set if an end 
of file configuration occurs. If the character A, occurring at the beginning 
of a word is not part of an end of file configuration, step 50 sets the tape 
error trap T,', and step 51 causes termination if the corresponding channel 
trap 1s enabled. 

Steps 53-56 are executed only if p; is a record gap. They reset the 
counters Q’ controlling the end of file test, test and reset the longitudinal 
parity vector Y’, and may cause termination in the event of an error. Step 
57 causes the character transfer of step 58 to be skipped if the character is 

* Since the partition A, is represented by a gap, the writing of one gap immediately 
following another, with no intervening data, has the effect (when subsequently read) of a 
single record gap. 
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— 1s = 241 + Ls) k—— 

Input-output test 10 JOT —+h : 0 — 
—,h+0 

Beginning of tape test 10 BTT —> k <0 — 
End of tape test 10 ETT —y k <I 

i< 19 fue — 1 ke 

L,*' : 0 — 


L,! <0 


Transfer on channel 11 TEF —>| i < - (1 | @/e — (8€) -G, 0) + cy 
end of file 
b<T,' 


T,' <0 


| 


Transfer on channel 11 TRC — i <— LC | al)/e — (Be) (2, 2) + cy 
redundancy 
b+, 


T,' —0 


in Operation 


Transfer on channel 11 TCN — k <1 
not in operation 


Transfer on channel 11 TCO —> k <0 f 


i« LU J attic — (8e)1(6, 0) 


b<—(6, &k) 

=i 7:1 | 
b:0 > 
s<abic |» 


M® <— w4(36)\s 
6:0 faze 


js<—1 eet 


Program 2.19 Input-output branch operations 
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a potential end of file. Steps 62-65 decrement the word count and transfer 
completed words to successive memory locations unless Cj, is zero. Step 
61 suspends these operations when the word count reaches zero. Since 
step 56 is followed by step 61, the occurrence of a record gap occasions the 
(potential) transfer of a word to memory even though it is incomplete. 
Because of the reset of D’ on step 35, the incompleted part of the word is 
ZeTO, 


Auxiliary channel instructions. Program 2.19 shows those branch instruc- 
tions which are controlled by indicators associated with the data channels. 
Each indicator tested is also reset. The last four instructions shown are 
subject to the trap mode. 


11 SCH 4 i< 2/0] ajc + cy 


¥ — €(36) 
— b,:0 


i: 


yo 
(3, iy — S? 


(Yis: Yeo) <0 
Mielec <y +> 
11 ENB y — Mi e%le 
E, «#8 /y 

E, —(10 | a8)/y 
E, — w/y 
E<E 


10 RCT e<«~] = 


Program 2.20 Trap control and store channel 


The channel indicators T may also cause interruptions as detailed in the 
instruction fetch phase. They are controlled by the enable matrix E and 
the enable trigger e which are set by the ENB (enable) and RCT (reset 
traps) instruction of Program 2.20. The instruction SCH (Program 2.20) 
permits storage of the channel registers. 
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2.3. DETAILED LOGICAL DESIGN 


Although a description couched at the programmer’s level specifies 
completely the functions of a computer, it requires considerable extension 
to provide a basis for the so-called /ogica/ design of circuits for realizing 
the computer. The extensions include: (1) the specification of sequence in 
the microprograms themselves; (2) further detailing of certain complex 
functions; (3) reduction of the number of operands (registers) required; 
and (4) economization in the underlying functions provided. The nature 
of these extensions will be indicated briefly. 

In principle, the problem of sequence control in the microprograms does 
not differ from the sequence control in computer programs. However, the 
function served by the sequence vector s (a base two representation of the 
address of the succeeding instruction) is frequently served instead by a ring 
or combination of rings. A ring is a logical vector r of weight one (that is, 
+/r = 1) capable of rotation (fr or |r) and of resetting to one of several 
initial positions p, (that is, r <e€?:.) 

Certain steps of a microprogram, which at the programmer's level may 
be considered as monolithic, must themselves be realized in circuitry as 
more detailed microprograms. The addition of two unsigned (positive) 
numbers represented in base two by the vectors « and y might, for example, 
be performed as in Program 2.21. The result » produced is correct only if 
the sum is less than 2°. 

Economization in the underlying functions provided is achieved by 
restricting the “data paths” provided between the various operands (1.e., 
registers) and by restricting the operands to which certain operations 
apply. Restriction of data paths implies that it is not possible for each 
operand to specify every other operand directly. For example, memory 
may be restricted to communicate only with the buffer register d so that 
any transfer from memory such as 


c< Mes 


Hl 


y : 0 
<—y 
yo tA 2) 


x <(x* % &) 


Program 2.21 Base two addition 
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Figure 2.22 Data paths 


must in fact be performed in two steps: 
d<—M-s 
c<d, 


An operation such as address decoding (i.e., conversion of the normal 
base two representation of an address / into a one-out-of-1 code of the 
form e€’ suitable for selecting word i from memory) is relatively costly and 
is not normally provided for all relevant operands (such as s and w!5/¢ in 
the 7090). Instead, decoding may be provided on a single auxiliary 
operand a; the selection of an instruction in the 7090 would then be 
executed in two steps: 

a<s 


c< M4 


All microprograms specified at the programmer’s level must, of course, 
be translated into equivalent microprograms which satisfy the path 
constraints. Path restrictions are perhaps best displayed as a “block 
diagram” showing the data paths provided between the various registers 
and operations units. Figure 2.22 illustrates a convenient representation in 
which the paths are shown as connecting Jines with arrowheads indicating 
the possible directions of transfer. If the indicated paths are further re- 
stricted to selected components of the operands, this restriction may be 
indicated by a pair of selection vectors separated by a colon. Thus the 
notation 


o)!5(36) : €(15) 


on the path between d and s of Fig. 2.22 indicates that transfers occur 
between w!)/d and s. The symbols r and ¢ denote the selection of a matrix 
row and a matrix column, respectively, as illustrated by the path between 
M and d. Permutations may be represented in the form pj. Thus if the 
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vector d were to be transposed (reversed in order) in the transfer to c, the 
path would be labeled with the expression 


Pj se, 
where p = |t°(36)| = (35, 34,..., 1, 0). 
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EXERCISES 


2.1 Write 7090 programs for each of the following operations: 

(a) M8 < M8 y M? 

(b) M8 «- M8 

{M6 <-2861( | M8 + i M?) 

(Mt (M8 + | MY): 25) 

(d) M8 «. /M7, a8, M?/ 

(e) M® «- /M7, w°%, M/ 

(f) M6 «- (M7, (8 | «)°), M9/ 

(g) M®< /M", M8, M%/ (Use |x, u,¥/ = (x AH) y (¥ A UD] 

(h) M8 «-~ tw! 
2.2 In the magnetic core technology employed in the 7090, logical disjunction 
(or) and negation are much easier to produce than conjunction (and). Limiting 
the logical functions employed to disjunction and negation, write microprograms 
for the following 7090 instructions: 

(a) ANS (Use De Morgan’s law, Sec. 1.8) 

(b) ERA 
2.3. In the magnetic core technology used in the 7090, each transfer ofa quantity 
¥ into a register x is actually an or with the present content of the register, i.e., 
x<«-y y x. A register may also be reset to zero. Subject to the foregoing 
restriction, write microprograms for 

(a) the operation I <— M‘, (Use two steps.) 

(b) the operations of Exercise 2.2(a). 
2.4 Describe the main portion of the instruction fetch of the 7090 (steps 9--18 
of Program 2.4) in an algorithm which satisfies the data path constraints of Fig. 
222: 


2.5 Repeat Exercise 2.4 so as to satisfy the constraints of Exercises 2.2 and 2.3 
as well. 


2.6 A vector p which permits only the following types of operation: 
(i) D <-\p,o', 
\: foe 1 
(re 
|p <- lip 
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is called a pushdown or stack vector. If the successive components of a stack 
vector p are represented by successive memory registers, then the operations 
affecting it can be controlled by a single address counter v, which is automatically 
incremented at each operation of type (i) (addition of a new final component) and 
is automatically decremented at each operation of type (ii) (reference to the final 
component accompanied by its deletion). 

(a) Using the 7090 registers and formats augmented by an address counter 0 
of dimension 15, write a microprogram for an operation LDS (load stack) 
which transfers M- @'°/¢ to the top of the stack [operation type (i)]. 

(b) Write a microprogram for STS (store stack) which transfers the top of the 
stack to M_-#'*/¢ [operation type (ii)]. 

(c) Write a microprogram for an operation AND which produces the and 
function of the top two components of the stack, deletes them, and appends 
the result as a new final component. [The net reduction in »(p) is one]. 

(d) The AND of part (c) has no associated address. Show that all 7090 
instructions (other than input-output) can be redefined so that only the 
LDS and STS require associated addresses. 

2.7 In the 7090, a decimal digit « is represented in direct binary coding in a 
six-bit logical vector x, (that is, | » = .«), and each register accommodates six 
decimal digits. Use the convert instructions (Program 2.12) in a 7090 program to 

(a) convert from binary to decimal. 

(b) convert from decimal to binary. 

(c) replace all leading zeros (i.e., all preceding the first significant digit) of a 
number represented in decimal. 

2.8 Write a 7090 program to convert 

(a) froma 36-bit binary code to a 36-bit reflected Gray code [see Phister (1958)]. 

(b) froma reflected Gray code to binary. 

2.9 A memory M is called a tag or associative memory if for any argument x 
it yields a direct indication of the row or rows of M which agree with x, If the 
resulting indication is in the form of a vector s such that the matrix s//M contains 
the indicated rows, then s = M “ x. More generally, a logical mask vector m is 
added to the system so that m/M is compared with the argument m/x and some 
desired function of 7m/M” is represented by m/M® for each k. In the following 
exercises M is assumed to be a logical matrix. 

(a) Use De Morgan’s laws (Secs. 1.8 and 1.11 or Sec. 7.1) to derive from the 
relation s = M *. x an expression for s which would be suited to a circuit 
technology in which disjunction and negation are easier to perform than 
conjunction. 

(b) Write a detailed algorithm using a row-by-row scan of M to determine 
s = (m/M) “* (mx). 

(c) Repeat part (b) using a column-by-column scan of M. 

(d) Use a column-by-column scan of M todetermine s such that s//M contains 
the rows of M of maximum base two value [see Falkoff (1962)]. 


chapter 3 


REPRESENTATION OF 
VARIABLES 


3.1 ALLOCATION AND ENCODING 


Although the abstract description of a program may be presented in any 
suitable language, its automatic execution must be performed on some 
specified representation of the relevant operands. The specification of this 
representation presents two distinct aspects—allocation and encoding. 

An allocation specifies the correspondences between physical devices and 
the variables represented thereby. An encoding specifies the correspond- 
ences between the distinct states of the physical devices and the literals 
which they represent. If, for example, certain numerical data are to be 
represented by a set of 50 two-state devices, the two-out-of-five coding 
system of Exercise 1.6 might be chosen, and it would then remain to 
specify the allocation. The two-digit quantity “hours worked” might be 
allocated as follows: devices 31-35 represent components {—5, respectively, 
of the first digit, and devices 29, 16, 17, 24, and 47 represent components 
1, 2, 3, 4, 5, respectively, of the second digit. 

The encoding of a variable will be specified by an encoding matrix C and 
associated format vector f such that the rows of f/C list the representands 
and the rows of f/C list the corresponding representations. The encoding 
is normally fixed and normally concerns the programmer only in the 
translation of input or output data. Even this translation is usually 
handled in a routine manner, and attention will therefore be restricted 
primarily to the problem of allocation. 

However, the encoding of numeric quantities warrants special comment. 
It includes the representation of the sign and of the scale, as well as the 
representation of the significant digits. Small numbers, such as indices, ad- 
mit not only of the usual positional representation but also of the use of the 
unit vector €’ to represent the number /(i.e., a one-out-of-n coding system), 
or of the use of a logical vector of weight / (i.e., a base | number system). 

Allocation will be described in terms of the physical vector m, which 
denotes the physical storage elements of the computer. Each component 
of m corresponds to one of the »(7) similar physical devices available, its 
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range of values is the set of physical states achtevable by each device, and 
its index is the address of the device. Each component of m may corre- 
spond to a computer register, an individual character position in a register, 
or an individual binary digit within a character, depending on the degree of 
resolution appropriate to the allocation problem considered. The 0-origin 
indexing normally used for computer addresses will be used for the physical 
vector, but l-origin indexing will, throughout this chapter, normally be 
employed for all other structured operands. 

An index of the physical vector will be called an address and will itself 
be represented in the (perhaps mixed) radix appropriate to the given com- 
puter. The Univac, for example, employs base ten addressing for the 
registers, and (because of the use of 12-character words) a radix of twelve 
for finer resolution. The address of the fourth character of register 675 
might therefore be written as 675.3. In computers which have two or 
more independent addressing systems (e.g., the independent addressing 
systems for main memory and for auxiliary storage in the IBM 705), 
superscripts may be used to identify the several physical vectors 7’. 

In general, the representation of a quantity x is a vector (to be denoted 
by e(x)) whose components are chosen from the physical vector m. Thus 
e(.c) = kix, where k is a mapping vector associated with x. The dimension 
of the representation (that is, »(p(~))) is called the dimension of x in nm. If, 
for example, p(r) = (myo, My, M7, My), then k = (10, 9, 17, 18), and the 
dimension of « in mis four. If e(7) is an infix of x, then the representation 
of x is said to be solid. A solid representation can be characterized by two 
parameters, its dimension ¢ and its /eading address f, that is, the index in 
of its first component. Then p(x) = (f'| «”)/7. 


3.2 REPRESENTATION OF STRUCTURED 
OPERANDS 


The grid matrix 


If each component of a vector * has a solid representation, then the 
representation of the entirc vector is said to be solid and may be charac- 
terized by the grid matrix T(x), of dimension x(x) x 2, defined as follows: 
I’, ‘(«) is the leading address of e(x;), and T,’(«) is the dimension of #, in 7. 
If, for example, the vector * is represented as shown in Fig. 3.la, then 


17 2 
19 4 
T(x) =] 27 5 
23 1 
32 3 


| X4 | Xs Xs | Representand 
22 | M23 | M24] M25) M26 | Taz} Mag | Tag | Tso | Mar | M32] M33 | M34 Physical vector 
(a) Representation of a vector * 
T,? re r3 T,! rT, r5 r,5 Representand 
T45 | Tas | Taz | Mag | M9 | M507 M51) M52] M53 | M54 | M55 | M56 | M57 | M58 ra Physical 
| vector 
2 9; 90),4)2)7 7/0) 5) 2 | 3 | 0 | 1 3 | 21 04} 3 Actual 
i | i 1 | i} ' 
value 


(b) Linear representation of I'(*) with / = 34 + 47 + 2). 
Figure 3.1 
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Figure 3.2 Linear representation of a matrix X 
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Any structured operand can first be reduced to an equivalent vector, and 
the grid matrix therefore suffices for describing the representation of any 
construct, providing only that the representation of each of its elements is 
solid. Thus a matrix X may be represented by either the row-by-row list 
r = E/X or the column-by-column list ¢ = E//X, and a tree T may be 
represented by the left list matrix [T or the right list matrix ]¥, either of 
which may be represented, in turn, by a vector. 

Ifa process involves only a small number of variables, it is practical to 
make their allocation implicit in the algorithm, Le., to incorporate in the 
algorithm the selection operations on the vector m necessary to extract the 
appropriate variables. This is the procedure usually employed, for 
example. in simple computer programs. In processes involving numerous 
variables, implicit allocation may become too cumbersome and confusing, 
and more systematic procedures are needed. 


Linear representations 


The representation of a structured operand is said to be /inear if each 
component is represented by an infix of the form (/ | &”)/m, where /is a 
linear function of the indices of the component. For example, the 
representation of the matrix X indicated by Fig. 3.2 is linear, with d = 2 
and /= —Il + Si + 8). 

A linear representation is solid and can clearly be characterized by a 
small number of parameters—the dimension d of each component and the 
coefficients in the linear expression /. The representation of a vector ™ is 
linear if and only if T,(¥) = de and the difference 6 = T(x) — TI '(¥) is 
constant for 7 = 2,3,..., »(%). 

If/=p-+qict ris the function defining a linear representation of a 
matrix X and if @ is the leading address of a given element, then the leading 
address of the succeeding element in the row (or column) is simply @ + r 
(or a+ q). Frequently, the succession must be cyclic, and the resulting 
sum must be reduced modulo »(X) x r (or w(X) xg). The inherent 
convenience of linear representations is further enhanced by index registers, 
which provide efficient incrementation and comparison of addresses. 

Linear representation of a structured operand requires that all com- 
ponents be of the same dimension in m. This common dimension may, 
however, be achieved by appending null elements to the shorter com- 
ponents. The convenience of the linear representation must then be 
weighed against the waste occasioned by the null elements. Moreover, if 
several vectors or matrices are to be represented and if cach is of unspecified 
total dimension in 7, it may be impossible to allot to each an infix suffi- 
ciently large to permit linear representation. Consequently, a linear 
representation is not always practicable. 
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Nonlinear representations 


Since the use of the grid matrix imposes only the condition of solidity 
for each component, it permits an allocation which is sufficiently general 
for most purposes. The grid matrix serves in two distinct capacities: (1) 
as a useful conceptual device for describing an allocation even when the 
actual allocation is implicit in the program, and (2) as a parameter 
which enters directly into an algorithm and explicitly specifies the allo- 
cation. 

If the grid matrix is used in a program as an explicit specification of the 
allocation, then the grid matrix must itself be represented by the physical 
vector. There remains, therefore, the problem of choosing a suitable 
allocation for the grid matrix itself; a linear allocation is illustrated by 
Fig. 3.10. 

If the grid matrix I(x) itself employs a linear representation, its use 
offers advantages over the direct use of a linear representation of » only if 
the total dimension of Tin x is much less than the total dimension of » in 
7m when linear representations are employed for both. This is frequently 
the case, since each element of a grid matrix belongs to the index set of x 
(that is, to u(r(7t))), and the dimension of each element in x is therefore 
both uniform and relatively small. Program 3.3 shows the use of the grid 
matrix P'(«#) and the encoding matrix C in determining the Ath component 
of the vector x, 


Program 3.3. A linear representation is assumed for T(x), with element T,‘(*) 
represented by the infix (p + gi + rj) | @)/m. Moreover, each element of T(x) 
is assumed to be represented in a base 6 number system. Step 1 determines the 
leading address of the representation of T(x). Step 2 specifies fas the base 6 
value of this representation, i.e., as the leading address of p(x,,). Steps 3 and 4 
specify d as the dimension of x, in m, and step 5 therefore specifies z as the 
representation of x,,. 

Steps 7-9 perform the decoding of z = p(x%,) to obtain z as the actual value of 
x,. Since this process is normally performed by human or mechanical means 
(e.g., a printer) outside the purview of the programmer, it is here expressed directly 
in terms of the encoding matrix C rather than in terms of its representation. The 
left-pointing exit on step 7 is followed only if z does not occur as an entry in the 
encoding matrix. 


The form chosen for the grid matrix is one of several possible. The two 
columns could, for example, represent the leading and final addresses of 
the corresponding representations or the dimensions and final addresses. 
The present choice of leading address f and dimension d is, however, the 
most convenient for use in conjunction with the notation adopted for 
infixes; the logical vector (f'| a”) selects the appropriate infix. 
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0-origin indexing for 7 only 
1—4 l<—ptgqkir xl p,q.) Constant, coefficient of row 
> ba index, and coefficient of column 
fo OME &)/n) index in the linear function for 
3 lelar the representation of T(x). 
4 d—bL((l| aim) b | Base used in representing ele- 
ments of T(x). 
5 z<(fy atin g | Dimension in x of each element 
6 h--w(C) +1 of T(x). 
7 =lheh—1 jf | Leading address of e(x,.). 
d | Dimension of e(*,) in ™. 
8 zo: f/Cr # . 
: z | e(*;). 
9 z< fiCh > C | Encoding matrix for com- 
ponents of x. 
f | Format vector for C. 
* | Character encoded by »,,. 
Legend 
Program 3.3. Determination of z = e(x,) and z = x, from a linear representa- 


tion of the grid matrix T(x) 


Chained representations * 


If a linear representation is used for a vector, then the deletion of a 
component (as in a compress operation) necessitates the moving (ic., 
respecification) of the representations of each of the subsequent com- 
ponents. Similarly, mesh operations (insertion) and permutations 
necessitate extensive respecification. The use of a grid matrix T(x) 
obviates such respecification in ¥, since appropriate changes can instead be 
made in I(x), where they may be much simpler to effect. If, for example, 
x is the vector represented as in Fig. 3.la@, and z is a quantity of dimension 
six in 7, then the mesh operation 


\ 


x <— ix, 2) 
may be effected by specifying the physical infix (70 | «5)/x by p(z) and by 


* Chained representations have received extensive treatment, frequently under the 
name “lists.” See, for example, Shaw et al. (1958) and Blaauw (1959). 
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respecifying I(x) as follows: 
17 
19 


~~ 
= 
WO RP umn & 


However, if the representation of I(x) is itself linear, then insertions, 
deletions, and permutations in » will occasion changes in all components of 
I(x) whose indices are affected. The need for a linear representation of the 
grid matrix (and hence for all linear representations) can be obviated by 
the use of a chained representation defined as follows. 

Consider a vector y, each of whose components y, has a solid represen- 
tation e(y,) whose infixes (g | @”)/e(y,) and a*/e(y,) are, respectively, 
the dimension of e(y,) in m and the leading address of the representation of 
the (cyclically) succeeding component of y (both in a base 6 system), and 
whose suffix «*”/e(y,) is the representation of the Ath component of some 
vector *. Then (the representation of) y is called a chained representation 
of x. In other words, the representation of y incorporates its own grid 
matrix (with the address column I\(y) rotated upward by one place) as 
well as the representation of the vector ». 

For example, if g = 2, b = 10, and * = (365, 7, 24), then 


O(¥1) = (Mz, Ty, Tig, M9, My, M2, Tz) = (6, 8, 0, 7, 3, 6, 5), 


(V2) = (Tyg, Megs Mrs Mri, Mr) = (2, 6, 0, 5, 7), 
and 
PCV) = (Mog, M7, Tyg, Mag, May, Msi) = C1, 7, 0, 6, 2, 4), 


is a suitable chained representation of x. 

The parameters required in executing an algorithm on a chained 
representation y are g, the common dimension in x of the elements of the 
grid matrix I(y); 6, the base of the number system employed in their 
representation; and fand A, the leading address and index, respectively, 
of the representation of some one component of y. The parameters g and 
b are usually common to the entire set of chained representations in use. 
Program 3.4 illustrates the type of algorithm required to determine e(*,.) 
from a given chained representation of x. 


Program 3.4. The loop (1-3) is executed r(x) lo (A — fh) times, with the result 
that at step 4 the parameter fis the leading address of e(¥,,). Step 4 therefore 
specifies das the dimension of p(y,,), that is, as the base 6 value of T,/(y). Step 5 
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0-origin indexing for 7 only 


1 h <- r(x) |, (A + 1) h, f | fis the leading address of the 
Ath component of the chained 
<—bL( fl a 
2 ae (ft ain) 4: representation of x, 
3-H Ask b | Base used for representation 
4 dbl f tg) | ain of the elements of the grid 
: ‘ matrix. 
5 z<-(f{ atin g | Dimension in x of elements of 
6 e(x,) <— B29/z |__ the grid matrix. 
d | Dimension in x of kth com- 


ponent of the chained represen- 
tation of x. 


% | kth component of the chained 
representation of *. 


Legend 


Program 3.4 Determination of pe(x,) from a chained representation of x 


then specifies z as e(y,,). Step 6 deletes those components of z which represent 
the elements of the grid matrix, leaving e(*,.). 

The parameters f and /i are themselves respecified in the execution of the 
algorithm so that # becomes & and f becomes, appropriately, the leading address 
of e(¥,.). A subsequent execution then begins from this new initial condition. 


The chained representation used thus far is cyclic and contains no 
internal identification of the first or the last components. Such an identi- 
fication can be incorporated by adding a null component between the last 
and first components of x. Alternatively the identification may be achieved 
without augmenting the dimension but by sacrificing the end-around 
chaining, 1.e., by replacing the last component of TP\(y) by a null element. 
Moreover, a chained representation may be entered (i.e., the scan may be 
begun) at any one of several points, provided only that the index # and 
corresponding leading address fare known for each of the points. 

The number of components of a chained representation scanned (steps 
1-3 of Program 3.4) in selecting the Ath component of * is given by 
y(*) |, (kK — fh), where fr is the index of the component last selected. The 
selection operation is therefore most efficient when the components are 
selected in ascending order on the index. The chaining ts effective in the 
forward direction only, and the component (4 — 1) would be obtained 
only by a complete cyclic forward scan of (x) — | eomponents. The 
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representation is therefore called a forward chain. A backward chain can 
be formed by incorporating the vector |T,(y) instead of TF(y), and a 
double chain results from incorporating both. 

A vector x which is respecified only by either deleting the final com- 
ponent or by adding a new final component (i.e., by operations of the 
form * < @1/x, or x <— * @ (z)) behaves as a stack (cf. Exercise 2.6). A 
backward-chained representation is clearly convenient for such a stack. 

A simple example of the use of a chained stack occurs in representing 
the available (1.e., unused) segments of the physical vector 7. This will be 
illustrated by a program for the vector compression 


x < v/x 


executed on a forward-chained representation of x. The unused segments 
representing the components of v/* are returned to a backward-chained 
stack or pool of available components. A linear representation can usually 
be used for logical control vectors such as v; in any case the problems 
involved in their representation are relatively trivial and will be subordi- 
nated by expressing each operation directly in terms of the logical vectors 
and not in terms of the physical components representing them. 


Program 3.5. In the major loop (6-23), & determines the index of the current 
component v,, and / and / determine the leading addresses of e(x,) and e(%;. 1), 
respectively. These three parameters are cycled through successive values by 
steps 7, 8, and 12 and are initialized by steps 2, 5, and 12. If v, = 0, the infix 
p(*,,) is returned to the pool by steps 21, 22, 23, and 6 so as to construct a back- 
ward chain. 

The parameter .v specifies the leading address of p(,) unless #(¥) = 0, in which 
case « is null. Step | terminates the process if »(*) = 0, and otherwise step 4 
respecifies as the null element. Ife = 0, this null value of » remains; if not, the 
first nonzero component of v causes a branch to step 14. Since « = 9, step 15 is 
executed to respecify w as the leading address of p((v/*),). Step 16 then specifies /, 
the leading address of the last completed component of v/x. Step 15 is never 
again executed. 

Components of vjx other than the first must each be chained (in a forward 
chain) to the preceding one. Hence the leading address / of a newly added com- 
ponent must be inserted in the last preceding component (whose leading address 
is). This is normally done by steps 18, 19, and 6; step 20 respecifies 4. If, how- 
ever, the component *,.., were also included, it would appear as the fast com- 
pleted component of v/x and would already be chained to the new component *,. 
This situation is recognized by step 17 and occasions a branch to step 16. Step 16 
then respecifies # and repeats the loop without executing steps 18, 19, and 6. 

The process terminates when the cycle through the chained representation of x 
is completed, that is, when / returns to the original value of ., preserved as 1 by 
step 3. Step 10 is then executed, terminating the process directly if r(v/x) = 0. 
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Otherwise, step 11 is executed to close the chain of v/x, that is, to insert , the 
leading address of e((v/*),), in the representation of the last component of v/x. 


0-origin indexing for 7 only 


1 — > « ° 
f non « ) Leading address of e(%,) 
3 (<2 if +) > 0; otherwise 


“me 


via 0° 


v | Logical vector, 


k<] k | Index of v. 


i | Leading address of e(%,). 


4 
5 
6 7] Or | aim) <—s 
7 pede sy Leading address of e(%,., 4). 
Net A Leading address of last 
8 arate | preceding component of 
9 iit ules 
P | Leading address of last 
10 eee anaes 
preceding component of 
11 b L(A @)/n) <x pool of available  seg- 
; mses ments, 
12 jb. { @)/n) ; chs. oA : 
: & | Dimension in w of cle- 
13 vy, 2 0 ments of grid matrices. 
14 ia vite b | Base of representation of 
elements of grid matrices. 
15 V<i 
16 hA<-i Legend 
17 % 1 2 0 
18 r<h 
19 S<—l 
i eer 
21 r<—l Ee 
22 sp 
23 -—4 p<i 


Program 3.5 Program for x <-v/* on a forward chained representation of x 
and a backward chained stack of available segments 


A chained representation can be generalized to allow the direct represen- 
tation of more complex constructs, such as trees, by incorporating the 
address of each of the successor components associated with a given 
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component. This notion is formalized in the chain list matrix of Sec. 3.4. 
The same scheme can also be employed to produce an efficient combined 
representation of two or more vectors which share certain common 
components. If, for example, x, = %,, and chained representations are 
used for both x and z, then * may be represented in standard form except 
that component »%, incorporates a secondary address, which is the leading 
address of %,.,. Moreover % has a standard representation except that 
%, , is chained to *,, with an indicator to show that the secondary address 
of the succeeding component is to be used. Deletion of any vector 
component in such a shared system must occasion only the corresponding 
change in the address chain of the vector, the actual representation of the 
component being deleted only when no associated address remains. 


Partitions 


If the set a is the range of the components of the physical vector m, and 
if some element, say 4,, is reserved as a partition symbol and is exeluded 
from use in the normal representation of quantities, it can be inserted to 
demark the end (or beginning) of an infix of a. If the veetor y is repre- 
sented by a single infix of m such that the beginning of component ¥,., 
follows immediately after the terminal partition of y,, then the structure of 
y is completely represented by the partitions, and y is called a partitioned 
representation. A partitioned representation can be used for more complex 
operands, such as matrices, if a set of two or more distinct partition 
symbols are provided, one for each level of structure. The distinct 
partition symbols can, of course, be represented by multiple occurrences of 
a single symbol a, rather than by distinct members of a. 

A partitioned representation is similar to a double-chained representa- 
tion without end-around chaining in the following particular: beginning 
from component y;, the component y, can be reached only by scanning 
all intervening components between / and / in increasing or decreasing 
order according as i<j or / > j. The file notation introduced in Sec. 
1.22 clearly provides the operations appropriate to a partitioned repre- 
sentation of a vector, with conventions which suppress all inessential 
references to the partitions themselves. 

The use of a partition to demark the end of an infix is particularly 
convenient when the infix must be processed component by component 
for other reasons, as in the use of magnetic tape or other serial storage. 
The partition also appears to be more economical than the grid matrix, 
which it replaces. This apparent economy is, however, somewhat illusory, 
since the reservation of a special partition symbol reduces the information 
content of each nonpartition component by the factor log, (a) — 1) + 
log, »(a), where a is the range of the components of 7. 
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Partitions can be employed in chained representations. For example, 
the dimension in x of each component of a chained representation y can be 
specified implicitly by terminal partitions instead of explicitly by the vector 
I,(y) of the grid matrix. Thus if the elements of I,(y) are of dimension g 
in 7m, then o!/e(y,;) = a), and (@ A @')/e(y,) = e(x;), where x is the 
vector represented by y. Program 3.6 shows the determination of p(*,) 
from a chained representation y with terminal partitions a,. 


Q-origin indexing for m only 

1 h«-w(x)|,(4 +1) k 

. h,f | f is the leading address of the 

<_ | t g . ” 
2 fab (fy ein) ; hth component of the chained 
3 —oHheik # representation of x. 

. . b Base used for representation of 
4, <—f t+ 
a i-ft8 _ the elements of the grid matrix. 
4b 4 ay g | Dimension in x of the elements 
4c. jajtl of the grid matrix. 
4d d-j—f a, | Partition symbol. 

; z Ath component of the chained 
i} a . 
5 za (fy a)/r representation of x exclusive of 
6 e(x,) <- &t/z the terminal partition symbol. 
d Dimension of z in 7. 


Legend 


Program 3.6 Determination of e(x,) from a chained representation of « with 
terminal partitions a, 


Program 3.6. The program is similar to Program 3.4 and the step numbering 
indicates the correspondences. The dimension d is so determined (steps 4a—d) as 
to exclude the terminal partition itself from the quantity z specified by step 5. 
Since only the first column of the grid matrix is incorporated in the partitioned 
representation, step 6 excises a prefix of dimension g rather than 2g as in 
Program 3.4. 


Pools 


Components of the physical vector z in use for the representation of one 
quantity must not be allocated to the representation of some other quantity. 
The construction of a chained representation therefore poses one problem 
not encountered in its use, namely, the specification and observation of 
restrictions on the availability of components of x. The restrictions can 
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conveniently be specified as a poo/, consisting of the available components 
of m. Each allocation made must then be reflected in a corresponding 
change in the pool. Moreover, as each piece of data is deleted, the com- 
ponents allocated to it are returned to the pool. 

If, as in Program 3.5, a pool is treated as a stack, then the component 
next taken from the pool is the component last added to it. The queue of 
components in the pool thus obeys a so-called fast in first out, or LLFO 
discipline. The dimension in zm of the last component of a pool will not, in 
general, agree with the dimension required for the next quantity it is called 
on to represent. If it exceeds the requirements, the extra segment may be 
left in the pool, and the pool therefore tends to accrue more and more 
components of smaller and smaller dimension. Hence it may be wise, or 
even essential, to revise the pool occasionally so as to coalesce the segments 
into the smallest possible number of infixes. This process can even be 
extended to allow substitutions in other vectors in order to return to the 
pool short segments which may unite existing segments of the pool. This, 
however, will require a systematic scan of the chained vectors. 

If the dimension of the last component (or perhaps of all components) 
of the pool falls short of the requirements for representing a new quantity, 
segments of the pool can be chained together. This requires the use of a 
special partition symbol or other indication to distinguish two types of 
links, one which marks the end of a given representation and one which 
does not. More generally, it may be convenient to use multilevel partition 
symbols to distinguish several levels of links, as was suggested for the 
representation of a matrix. 

Queue disciplines other than LIFO may be used. Three other types of 
primary interest in allocation queues are the FIFO (first in first out), the 
dimension-ordered, and the address-ordered disciplines. FIFO uses a 
forward chain and may be preferred over LIFO because it uses the entire 
original pool before using any returned (and usually shorter) segments. 

The components of a dimension-ordered pool are maintained in 
ascending (or descending) order on their dimensions in m. This arrange- 
ment is convenient in selecting a pool element according to the dimension 
required. The components of an address-ordered pool are arranged in. 
ascending order on their leading addresses. This arrangement facilitates 
the fusion of components which together form an infix of 7. 

If each of the available components of 7 is set to a special value which 
is used for no other purpose, then the available components can be 
determined by a scan of m. Such a pool has no structure imposcd by 
chaining and will be called a marked pool. 

A marked pool requires little maintenance, since components returned 
to it are simply marked, but selection from it requires a scan of m and is 
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therefore relatively slow. The use of marked and chained pools may also 
be combined—all returned components go to a marked pool which is left 
undisturbed until the chained pool is exhausted, at which time the entire 
marked pool is organized into a chained pool. 


Summary 


Since any structured operand can first be reduced to an equivalent 
vector, the problems of representation can be discussed in terms of vectors 
alone. The characteristics of the linear, chained, and partitioned repre- 
sentations of a vector may be summarized as follows. A linear representa- 
tion permits the address of any component to be computed directly as a 
linear function of its indices and hence requires no scanning of the vector. 
However, the strict limitations which it imposes on allocation may en- 
gender: (1) conflicts with allocations for other operands, (2) waste of storage 
duc to the imposition of a common dimension in x for all components, or 
(3) uneconomical execution due to the extensive reallocations occasioned 
by the insertion or deletion of other than terminal components. 

The concept of the grid matrix is helpful even when the corresponding 
allocation is implicit in the program. The explicit use of a grid matrix 
which is itself in a linear representation removes the restrictions on the 
allocation of the vector itself while retaining the advantage of direct 
address computation. The address computation differs from the linear 
case only in the addition of a single reference to the grid matrix and hence 
requires no scanning. The difficulties enumerated for the direct linear 
representation are not eliminated but merely shifted to the linearly 
represented grid matrix itself, where they may, however, prove much less 
serious. 

A chained representation allows virtually arbitrary allocation, relatively 
simple operations for the insertion and deletion of components, the direct 
representation of more complex structures such as trees, and economical 
joint representations of vectors which have one or more components in 
common. However, a chained representation requires extra storage for 
the grid matrix which it incorporates and occasions additional operations 
for scanning when the components are selected in other than serial order. 
The required scanning can be reduced by the retention of auxiliary 
information which allows the chained representation to be entered at 
several points. 

A partitioned representation requires the allocation of a single infix of 
7m, and selection requires a fine scan, i.€., a component-by-component scan 
of m to detect partition symbols. Partitioning removes the need to 
incorporate the grid matrix explicitly and does not impose a common 
dimension in x for all components, 


§3.3 Representation of matrices 119 


Mixed systems employing combinations of linear, chained, and parti- 
tioned representations are frequently advantageous. Block chaining, for 
example, involves the chaining of blocks, each consisting of an infix of 7 
and each serving as a linear representation of some infix of the represented 
vector. Alternatively, each chained block may be a partitioned represen- 
tation of some infix. 


3.3 REPRESENTATION OF MATRICES 


Structured operands other than vectors may be represented by first 
reducing them to equivalent vectors which can, by employing the tech- 
niques of the preceding section, be represented, in turn, in the physical 
vector 7. In the case of a matrix A, two alternative reductions are of 
interest, the row list r= E/A = ALG AP C)+++ GA” and the column 
list c = E//A. If r,, A,’, and c¢, are corresponding elements of the three 


alternative representations, then in a O-origin system: 


h=wvi t+ j, 
k= it wy. 
Consequently, 
i=|h=o]=pelyk 
and f=rla= lk = pl. 


The dependence of # on & can be obtained directly by substituting the 
foregoing expressions in the identity 


h=vx |h+optola 
to yield h=v x (ulok) + [k + eI. 
Similarly, k= X (vk) + [h = 2]. 


The permutation A which carries the row list r into the column list c 
(that is, c = Ayr) can be obtained directly from the foregoing expression 
for has follows: 

h=v x (wel, 0°) + [vo = wel. 


The expression for the Ath component of A is identical with the expression 
for h above. Hence, if ¢ = Afr, then c, = ry, = 1, as required. 

If the row list (or column list) is itself represented linearly, then the 
address of any component A,’ is obtained as a linear function of the indices 
fand/. Ifeither a file or a chained representation is to be used for the list 
vector, then the components are processed most efficiently in serial order, 
and the use of column list or row list is dictated by the particular processes 
to be effected. 
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If a large proportion of the elements of a matrix are null elements, it is 
called a sparse matrix. Sparse matrices occur frequently in numerical 
work (where zero serves as the null element), particularly in the treatment 
of partial difference equations. A sparse matrix A can be represented 
compactly by the row list r = U/A, and the logical matrix U, where 
U=(A 70). The matrix A may then be obtained by expansion: 
A= U'r, 

Alternatively, the column list ¢ = (A #0)//A may be used. The 
transformation between the column list c and row list r must, in general, 
be performed as a sequential operation on the elements of U. Since it is 
frequently necessary to scan a given matrix in both row and column order 
(e.g., as either pre- or post-multiplier in a matrix multiplication), neither 
the row list nor the column list alone is satisfactory. A chaining system 
can, however, be devised to provide both row and column scanning. 

Let L be a matrix such that L, is a list of the nonzero elements of a 
matrix A in arbitrary order, L,' is the column index in A of element Ly’, 
and L,' is the row index in L of the next nonzero element following Ly’ in 
its row of A. If L,‘ is the last nonzero element in its row, L,' = °. Let f; 
be the row index in L of the first nonzero element of row A’, and let 
fF, =° if A? =0. The following example shows corresponding values of 
A, L, and f: 


go 7 
6009 5 3° 3 

0300 6 1 5 
A={]00 0 0] L=]3 2 °| f=]- 
7804 9 4° 6 
005 0 Pd 2 

4 4° 


The matrix E will be called a row-chained representation of A and may be 
used, together with the vector f, for the efficient scanning of any row A‘ as 
illustrated by Program 3.7. The vector L3 can be modified so as to give the 
address in 7 directly rather than the row index in L of the next element in 
the row, and Program 3.7 can then be easily re-expressed in terms of the 
physical vector 7. 


Program 3.7. Step 2 yields the index in L of the first element of the ‘th row of A. 
Step 4 determines its column index /, and step 6 determines the index of the 
succeeding component. The process terminates at step 3 when the scan of the 
row is completed. 

If L, is chosen as a row list, the vector L, reduces to the form L," =k + 1 
or EL," = ©. Its function can then be served instead by incrementation of 
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1 Ai <0 1-origin indexing | 
2 k<f; . . 
Sf; Row index in L of first nonzero 
3 kie [Fy element of row A‘. f; = ° if A? = 0, 
4 jobs k Row index in LE of next element. 
5 At «Lb L, | List of nonzero elements of A. 
j 1 . . , 
L,* | Column index in A of E,*. 
6 k<L,* . . 
L,* | Row index in LE of next nonzero 
element following L,* in its row in 


A, L," =o if no such element 
exists. 


Legend 


Program 3.7 Determination of the row vector A’ from a row-chained represen- 
tation of A 


the index & and by the use of the logical vector u = (L, = °e) for deter- 
mining the end of each row. 

The construction of a column-chained representation is analogous to 
that of a row-chained representation, and the two representations can be 
combined in a single matrix L which gives both row and column chaining 
employing but a single representation (that is, L,) of the nonzero elements 
of A. 


3.4 REPRESENTATION OF TREES* 


A tree T may be represented by a matrix and hence, in turn, by a vector 
in a number of useful ways as follows: 


1. by a full right list matrix JT or by any column permutation thereof 
(Sec. 1.23), 

by a full left list matrix [T or by any column permutation thereof, 
by a right list matrix «?/]T, 

by a left list matrix «?/[T, 

by various chain list matrices. 


wR WN 


The fullleft and right lists seldom prove more convenient than the more 
concise left and right lists. Except for the special case of a homogeneous 


* Johnson (1962) provides a comprehensive treatment of the representations of trees 
and discusses the suitability of each representation for a variety of search procedures. 
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tree, both the right list and the left list are awkward to use for path tracing. 
This function is better served by the chain list matrix, to be defined as a 
formalization of the chaining scheme suggested in Sec. 3.2. 


Simplified list matrices 


In certain important special cases, the various list representations of 
trees may be simplified. If the degree of each node is a known function 0 
of the value of the node, then for any list matrix M, M,’ = 0(M,’), and the 
degree vector M, may be eliminated with- 
out loss. The node vector alone then repre- 
sents the tree and may be referred to as a 
right or left list vector as the case may be. 

For example, in the tree of Fig. 3.8 
(which represents the compound logical 
statement * A (y V 2)), a fixed degree is 
associated with each of the logical operators 
and, or, and not (namely, 2, 2, and 1), and the 
degree zero is associated with each of the vari- 
ables. The statement can therefore be repre- 
Figure 3.8 The compound sented unambiguously by the left list vector 
logical statement ® \ (y Y <) 


v=(A, ,2%, V.¥, 2). 

This is the so-called Lukasiewicz, Polish, or parenthesis-free form of the 
compound statement [Lukasiewicz (1951) and Burks et al. (1954)). 
Frequently, the only significant nodes of a tree T are its Icaves (e.g., in 
Example 3.2 and in a certain key transformation of Fig. 4.7) and all other 
nodes may be considered as nulls. Hence if M is any list matrix, the 
significant portions of M, and M, are (M, -7.0)/M, and (M, = 0)/M,, 
respectively. These significant portions may then be coalesced to form the 


single vector 0), My) 
v= M,, (M, = ), vio 


which, together with the logical vector (M, = 0), forms a J/eaf list matrix 
that describes the tree. Moreover, if the values of the leaves are distin- 
guishable from the components of the degree vector, the logical vector 
(M, = 0) may also be dropped. 


The use of left lists 


The use of the right list matrix is illustrated by the repeated selection 
sort treated in Sec. 6.4. The use of left lists will be illustrated here by two 
examples, each of interest in its own right: the partitioning of the left list 
of an w-tuply rooted tree to yield the left lists of the component singular 
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1 l-origin indexing 
2 lz Given left list of T. 
3 i Row index of Z in ascending 

scan. 
4 r Indicated number of roots of 
5 current rooted subtree. 
6 m | Moment of current rooted 

subtree. 
7 D | Partition vector of Z, that is, 
8 D; = :4T)). 

—I 
9 
Legend 


—_ — 
—_- © 


Program 3.9 Partitioning of the left list of an n-tuply rooted tree 


subtrees and the construction of a Huffman minimum-redundancy prefix 
code. 


Example 3.1. Partitioning of an -tuply rooted tree. Program 3.9 shows a 
scheme for partitioning a left list Z of a tree T into component subtrees, i.e., for 
determining the vector p such that p; is the moment of the singular subtree T;. 
Thus »(p) = »(T), py; = (T,), and the infix (p 2 @ 7) | «.)//Z is the left list 
of T;. 

The loop 6—10 scans successive components of the degree vector Z, (in ascend- 
ing order) and computes r, the indicated number of roots, The value of r in- 
creases by, at most, one per iteration, and when r becomes unity, the end of a 
singly rooted tree has been reached. Its moment #7 is then appended (step 11) as 
a new final component of the partition vector p, the parameters #1 and r are reset, 
and the scan of the next rooted tree is begun. Normal termination occurs at 
step 3; termination at step 6 indicates ill formation of Z. 


Example 3.2. Huffman minimum redundancy prefix code. If b is any set such 
that (6) > 1, then any other finite set a4 can be encoded in 8, that is, represented 
by b. (The sets 4 and 6 may be called the ‘alphabet’ and ‘basic alphabet,”’ 
respectively.) Ifv(a) << »(b), the encoding may be described by a mapping vector 
R such that e(¢,) = b,. If v(a) .- v(b), then each a, must be represented by a 
vector x' < b. For example, ifa = °10)and 6 = 12), then the decimal digits a 
may be encoded in the so-called 8421 system: 


(2€(4)) | « =a, 
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(0) 
(1000) 
001) 
aol 


(1 10) 


aid) 


Figure 3.10 Construction of a Huffman prefix code 


In so-called fixed length coding the vectors x’ have a common dimension ¢/, and 
the decoding of a message m (consisting of the catenation of vectors x‘) involves 
the selection of successive infixes of dimension ¢. If the probability distribution 
of the characters a, occurring in messages is not uniform, more compact encoding 
may be achieved by using variable length codes and assigning the shorter codes to 
the more frequent characters. Decoding of a message in variable length coding 
can be performed only if the boundaries between the successive x, are indicated in 
some way. 

The boundaries between characters in a message in variable length code may 
be demarked by special partition symbols (which is inefficient) or by using a 
prefix code in which no legitimate code point x* is the prefix of any other legitimate 
code point, including itself. The index vectors of the leaves of any tree possess 
this property; conversely, any set of prefix codes can be arrayed as the leaves of 
some tree. Hence if each character of the set to be encoded is assigned as the leaf 
of acommon tree, and if each character is encoded by the associated index vector, 
a so-called prefix code is attained. Figure 3.10 furnishes an example of a binary 
code (i.c., the branching ratios do not exceed two) constructed in this manner. 
0-origin indexing is used. The discussion will be limited to binary trees. 

Iff; is the frequency of the ith character and J; is the length of the assigned code 
(i.e., the length of path to the root), then the most efficient code is attained by 
minimizing the scalar product f ¢ 1. This may be achieved by the following con- 
struction, shown to be optimal by Huffman (1952). First, the characters to be 
encoded are all considered as roots, and the two roots of lowest frequency are 
rooted to an auxiliary node (shown as a null element in Fig. 3.10), which is then 
assigned their combined frequency. The process is repeated until only two roots 
remain. The tree of Fig. 3.10 is optimal with respect to the frequencies shown to 
the left of the leaves. The appropriate combined frequencies are shown to the left 
of each of the nonleaves. 

Programs 3.1! and 3.12 show the construction of the tree T representing a 
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Program 3.11 Construction of the binary Huffman code T for characters ¢ 
with frequency f 


Huffman code for a set of characters c,; with frequencies f;, the former in terms 
of the tree itself and the latter in terms of its left list. 


Program 3.11. The frequency vector f is permuted (step 5) to bring it to ascend- 
ing order, and the tree is subjected (step 3) to the same permutation. Step 4 
replaces the first two rooted subtrees of T by the single subtree obtained by root- 
ing them in a null, and step 6 makes the corresponding alterations in the 
frequency vector. The tree is initialized (step 1) as a one-level tree whose roots 
are the given characters, and the process terminates when the number of roots of 
T has been reduced to two. 


Program 3.12. The tree T of Program 3.11 is represented by the left list node 
vector %, in conjunction with the implicit degree vector d = 2 x (2 =e). The 
algorithm differs from Program 3.11 primarily in the reordering of the subtrees 
(steps 6-9), Step 7 appends to x the left list of the ‘th subtree (of the reordered 
tree) selected by the partition vector p according to the conventions of Program 
3.9. Step 10 prefixes x by the new null root, and steps [1-12 redefine p 
appropriately. 


Program 1.2] can be applied to the left list produced by Program 3.12 to deter- 
mine the associated index matrix (in a 0-origin system), and hence the actual 
codes assigned. 

It is not essential that the characters be assigned to leaves in precisely the order 
specified by Programs 3.11 and 3.12, and it is sufficient that the dimension of the 
leaf index increase monotonically with decreasing frequency of the character, It 
is therefore unnecessary to carry the characters themselves through the process; 
it suffices to determine the structure of the tree, sort the corresponding index 
matrix to right list order (which is ordered on dimension of the index vectors), 
and assign the characters (in decreasing order by frequency) to successive leaves. 
Since the structure of such a tree (whose nodes have a common irrelevant value 
and whose nonleaves all have a common branching ratio equal to the number of 
roots) is sufficiently determined by the moment vector u(T), the process of Pro- 
gram 3.12 can be simplified. 


126 Representation of variables §3.4 


1 zZ<—c l-origin indexing 
T 
2 D < &(r(%)) c | Given character set. 
3-—a Wf): 2 Bamere z | Left list of Huffman 
tree. 
4 i<] : 
J; | Frequency of ith sub- 
5 x < €(0) tree of 2. 
6 i Of); Pp | Partition vector p; is 
; : : the moment of the ith 
7 XN G) (((p iv a1) 1 aPi)/z) subtree of 2%. 
8 i<it+l x* | Reordered left list 
<{o.. with subtrees in 
? ees) ascending order on 
10 z<—(°) Ox frequency. 
1 p ~ (lf Ip 
( ( j Legend 
12 p<-U+(p foe) © w/p ; 
nue Program 3.12 Construction of the 
13 SF — OF IF left list z of the binary Huffman 
14 f —(+/elf) © Rif code for characters ¢ with fre- 
quency f 


Chain list matrices 


The full chain list matrix of a tree T is a matrix P of dimension p(T) x 
(O(T) + 2) defined as follows: P, is some node vector of T, P, is the 
associated degree vector, Pi ,, is null if j exceeds the associated degree P,' 
and is otherwise the row index in P of the jth node emanating from node 
P,‘, Table 3.13 shows a full chain list matrix for the tree of Fig. 1.16. A 
full chain list matrix is called a full right (/eft) chain list matrix if the nodes 
occur in right (left) list order. 

The full chain list matrix is a formalization of the scheme suggested in 
the discussion of chained representations (Sec. 3.2). Its convenience in 
forward path tracing is obvious. Since it does not identify the roots of the 
tree, an auxiliary vector must be provided for this purpose. However, if 
the ordering chosen for the nodes is that of a right list, the roots occur 
first in the list, their number r = »(P,) — (+/P,) is specified by the degree 
vector P,, and the need for the auxiliary vector vanishes. Moreover, since 
a right list groups all nodes emanating from a given node, each row of 
a?/P is simply a sequence of integers followed by null elements, and the 
information necessary to path tracing is provided by the column P3 alone. 

The right chain list matrix of a tree T is therefore defined as a°/P, where 
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dyn Q d’ | n’| p din |) f | ah 
1 1 n | 18 ° ° ° 3 a 4 1 n 9 18 l 
2} 2 g | 16 | 26] ¢ ° 2 b 7 2 g ° | 16 2 
3 | 0 u 9 9 9 ° 2 g 9 0 u | 14] oe 3 
4/0 t ° ° ° © 4 ec | ll 0 t |} 19] © 4 
5 | 3 a | 24 9 1 2 0 Z ° 3 a 6 | 24 5 
6 | 2 b 8 | 20] ° ° 1 n | 15 2 b 2 8 6 
71,0 Vv % ° ° ° 0 k ° 0 Vv ° ° 7 
8; 0, k ° ° ° ° 3 h | 16 0 | k |} 20 > 8 
9; 0 Zz ° ° ° 9 0 j ° 0 Zz 1} ° 9 
10; 0} o 3 © 9 ° 0 1 ° 0} oo; 17] 2° | 10 
11 | 0 f ° © ° 9 0 f 9 0 f 1} is] o 7] ll 
12 | 0 r ° ° ° ° 0 d ° 0 r { 21 o | {2 
13 | 0 y ° 9 ° ° 0 r ° 0 y ° o | 13 
14 | 2 S 4] 19] © ° 0 e ° 2 s } 23 4] 14 
15 | 0}d 9 9 9 9 2 i | 19 0} d |} i2}] ° 7] 15 
16] 0 j ° e e ° 0 | o ° 0 j | 26 > | 16 
17} 3 |m 3} 14} 23 9 3 m | 21 3 | m 7 3) 17 
18 | 2 i | 22) 25 | © ° 0 v 9 2 i > | 22] 18 
19| O} wie 8 9 ° 0 P ° O}w 2 o | 19 
20; 3 | h | 10) 17 TI| eo 0} q © 3 h ° | 10} 20 
21 | 0 e 9 3 9 9 0 u ° 0 e 2 > | 21 
22} 0] p 9 ° 9 ¢ 2 s | 24 0 |] p | 25] 2° | 22 
23 | 1 x | 13] ¢ e ° 1 x | 26 | x > | 13] 23 
24) 4) ¢ J ll} 15] 12 | 21 0 t 9 4 c 9] Ll | 24 
25 | 0 q S ° ° S 0 Ww e 0 q 5 2 | 25 
26! 0 1 ° 9 9 e 0 y 9 0 l 2 2 | 26 
A full chain list matrix The right chain Filial-heir chain list 
list matrix 
(a) (6) (c) 


Table 3.13 Chain lists of the tree of Fig. 1.16 


P is the full right chain list matrix of T. It is illustrated by Table 3.134 
Program 3.14 shows its use in path tracing. Although the degree vector P, 
is redundant (that is, P; and P, can be determined one from the other), it 
provides a direct check (step 6) on the legitimacy of the index vector r 
which would be difficult to obtain from P, alone. 

For a search of the type described by Program 3.14, it is necessary to 
scan down a level until agreement is reached and then across to the next 
level. For this type of scan, the filia/-heir chain list is compact and con- 
venient. 
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1— «+1 l-origin indexing 
Z| 
2 Lae r | Given index vector. 
3 d<(P,) — +/Py P | Right chain list matrix of T. 
4—4 7: Wr) Ea P, | Degree vector of T. 
5 fepe P, | Node vector of T. 
PA \Pal mond P, | Chaining vector of T. 
p | Path vector T’. 
7 ee SS a d_ | Degree of current node. 
8 D; < P,' k | Base address of the infix con- 
9 d <P, taining the current node. 
io k = P,! i ae. succeeding node in the 
j Current index of index vector 
r. 


Legend 


Program 3.14 Determination of the path p = T’ from the right chain list 
matrix P 


The set of (j + 1)th level nodes of the subtree T, are collectively called 
the jth filial vector of node i, and the first member of the first filial vector of 
node / is called the heir of node i. (For brevity, the first filial vector of a 
node will also be called its filial vector.) If each node is chained only to its 
successor in the filial vector containing it and to its hetr, the resulting 
representation is called a filial-heir chain list. Formally, the filial-heir 
representation of a tree T is a matrix F of dimension u(T) x 4 such that 
F, is a node vector of T, F, is the associated degree vector, Fy 1s a filial 
chain such that F,' = 7 if node F,/ is the successor of node F,' in the 
smallest filial set containing it and F,’ = « ifnode F,' has no such successor, 
and F, is an heir chain such that F,’ = h if node F," is the heir of node F,' 
and F,! = o if F,’ is a leaf. The filial-heir chain list is illustrated in Table 
3.13¢. 
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EXERCISES 


The symbols @ and c will be used exclusively to denote lower case and capital 
alphabets defined as follows: 

a@ =(0,4a,b,¢,...,Z,06.5. 4, %, +). 

c =(0,A,B,C,...,Z,..5, 4, *, +). 
The expression m < » will be used to specify the set « as the range of the com- 
ponents of 7. 


3.1 For each of the following cases, specify a suitable encoding matrix and 
format vector and show the explicit value of the infix of m which (in a solid 
representation) represents the given example vector #: 
(a) the decimal digits @ = 1°(10) in a ranked fixed-length code form & v2). 
Example: * = (6, 8, 9). 
(b) the set @ in a ranked fixed-length code for m & 1(2). 
Example: * = (c, a, t). 
(c) the seta@U cy 910) ina fixed-length code for m € v0). 
Example: x = (M,a, y,°, 3,,, 1,9, 6,0, .). 
(d) the set aU ¢ in a two-case code (with single-character shift) form & a, 
(See Brooks and Iverson, 1962.) 
Example: » =(T,r,0,y,,,N,.,Y,.). 
(e) the set @ in a Huffman prefix code for m < (2). Assume the frequency 
distribution given in Dewey (1923). 
Example: « = (t, r,e, e). 
3.2 For each of the cases of Exercise 3.1 write a program which decodes the 
infix (/ | o/)/a, that is, which produces the vector 2 represented by the infix. The 
auxiliary physical vector! < s may be employed to represent the first column of 
the encoding matrix, where s is the set encoded. Perform a partial trace of each 
program for the example value used in Exercise 3.1. 
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3.3. The ordered set of months mt =(JANUARY, FEBRUARY...., 
DECEMBER) is to be represented by the physical vector m < ¢ Vy w(10). For 
each of the following types of representation, specify a particular representation 
and show the values of the relevant components of 7: 
(a) a linear representation (employing null elements for filling to a common 
dimension in 7). 
(b) a solid representation for cach element of 77 and an appropriate grid 
matrix itself represented linearly. 
(c) a chained representation. 
(d) a double chained representation. 
3.4 (a) For each of the cases of Exercise 3.3, write a program which selects 
month 17, 
(b) Trace each program for the case k = 2. 
(c) For case (d) of Exercise 3.3, write a program which selects 7, by 
forward chaining if & :° rm) +2, and by backward chaining if 
ko. wm) + 2. 


3.5 For cach of the cases of Exercise 3.3, write a program which “prints out” 
the set of months in a minimum number of #-character lines, inserting a single 
null between successive months except where (i) further nulls must be added to 
prevent the continuation ofa single word from one line to the next, or (11) no null 
is needed between two successive words, the first of which is coterminous with 
the line. In other words, produce a matrix Z of row dimension # and of minimum 
column dimension such that (2 - »E)/Z =(e(mt,)°- etm) — +--+ | e(77yy), 
and such that each row Z may be partitioned into one or more vectors of the 
forme(m,) - ce, all but the last of which must be of dimension +fe(17z,.)] + 1. 


3.6 Assuming a linear representation for cach of the logical vectors involved, 

and a forward-chained representation for cach of the remaining operands, write 

programs for the following operations. Assume in each case that the arguments 
and ¥ need not be retained, and assume the use of a backward-chained pool 
where necessary. 

(a) F< KX, Uy 

(b) z< /x, u,y/ 

(c) 2< ATs 

(d) z< Alex 

3.7 Repeat Exercise 3.6(a), using separate grid matrices for x, ¥, and z instead 

of chained representations. Specify a suitable linear representation for each of 

the grid matrices. 

3.8 (a) Ifa chained representation is used for a vector x, then the selection of a 
specified component can be made faster by providing a number of 
alternative starting points for the required scan. State precisely the 
quantities required in such a process and write a program showing its 
use. 

(b) If provision is made for starting the scan at any component of x, the 
chained representation may itself be simplified. Show precisely what 
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the simplified form is and identify the type of representation to which it 
is equivalent. 
3.9 Frequently a vector « kept in a partitioned representation (for efficient use 
of storage) must be “unpacked” to a linear or other more accessible form for 
efficient processing. The converse operation of “packing” is also required. Let 
the partitioned representation be a file & employing an intercomponent partition 
A,, and a terminal partition Ay, and write both packing and unpacking programs 
for each of the following cases. Assume that the maximum dimension in z of any 
component is a. 
(a) A solid linear representation employing null fill. 
(b) An allocation prescribed by a grid matrix G with G, = ne. 


3.10 Let a < 1"(2), let the set @ be encoded in a five-bit code such that (2e) 
e(a@;) = 7, and let each component of the vector x be an (uncapitalized) English 
word. Using O-origin indexing throughout, specify a suitable partitioned repre- 
sentation in x for the vector x, and repeat Exercises 3.9(a) and 3.9(b), using it in 
licu of the files. 


3.11 For cach of the following pool organizations, write a program to convert 
a given marked pool into a backward-chained pool: 

(a) dimension-ordered. 

(b) address-ordered. 


3.12 For each of the following queue disciplines, write programs which take 
from and return to the pool an infix of length #. Use secondary linking and 
‘relegate to a marked pool any infix which is too short for linking. In cach case 
choose the type of chaining best suited to the particular queue discipline. 

(a) LIFO (last-in-first-out). 

(b) FIFO (first-in-first-out). 

(e) Dimension ordered. 

(d) Address-ordered (utilize the possibility of fusing adjacent infixcs). 
3.13 Give a complete specification of a scheme for representing a tree T by a 
full chain list matrix which ts not in right list order. Write a program (expressed 
in terms of the physical vector 7) which determines the path vector Ti fora given 
index vector i. 


3.14 Give a complete specification of a scheme allowing joint representation of 
those components shared by two or more of a family of vectors x's was 
suggested in Sec. 3.2. Write programs to (i) select component x,', and (11) delete 
component *,'. 


3.15 Let mc ay 10), and let x1, x7,..., x" be a family of vectors whose 
components belong to the set &/[a U W(10)]. Let the average and the maximum 
dimensions of the vectors x’ be a and #, respectively. Assume that the chaining 
index is represented in decimal, with cach digit represented by one component 
ofa. Determine (as a function of #7 and #) the value of a below which a chained 
representation provides more compact storage than a linear representation with 
null fill. 
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3.16 Write a program which uses the minimization operation u~ [x to 
determine the ordering permutation vector p < 0,/(@ 4, 8). 
3.17 Let U =(X 0) andr = U/X jointly represent the sparse matrix X. 
(a) Write a program which determines (as a function of U andr) a suitable 
row-chained and column-chained representation of X. 
(b) Write a program defined on the representation produced in part (a) to 
compute the product Y = X | X, itself represented in the form V = 
(¥ » O)and p = V/Y. - 
(c) Write a program to determine the trace (that is, +/I/X) of X from the 
representation produced in part (a). 
3.18 The unique assignment of Huffman codes produced by Program 3.f2 ts, 
in general, only one of many equally efficient assignments, since the symbols to be 
coded need only be assigned, in decreasing order on frequency, to the leaves of 
the code tree in increasing order on their levels. Show that the structure of the 
tree produced can be sufficiently described by its moment vector alone, and write 
a program for the construction of a Huffman code based on this fact. 


3.19 Following the notation and terminology used in Program 3.9 for the 
analogous case of a left list write a program which determines from the right 
list R of a tree T, the partition vector p which partitions it by levels. 
3.20 Write a program which determines the right list R = a*/]T as a function 
of the left list £ = a°/[T. Incorporate tests of well formation. 
3.21 Let [Xe 1) denote the pth power of the square matrix X with respect to the 
Operators ©, and Gy, that is, [X<71]" = X35 X -1--- 1X to p factors. 
(a) Show that ([C ; }"),/ = 1 if and only if there is a path of length p from 
node / to node ; in the graph (n, C). 
(b) Show that [C Q }’ = 0 for some p - »(C) if and only if (1, C) contains no 
circuits. 
(c) If (2, C) contains no circuits, the connection matrix C is said to be “‘con- 
sistent.”’ The result of part (a) can be used to check consistency. Program 
the alternative method of Marimont (1959). 
(d) FH =C \ I, then ([H : ]")? = Lifand only if / = jor there exists a path 
from node i to node j of length a « p + |. Show that for any connection 
matrix C, [H : ]’ converges to a limit. 


3.22 Devise programs to determine 
(a) whether a given connection matrix C represents a trec. 
(b) the left list of the tree (71, C). 
(c) the right list of the tree (n, C). 
(d) a node list m and connection matrix C as a function of 
(i) a left list L 
(ii) a right list R. 
3.23. Show that (nm, C) and (ny, Cy?) represent the same graph for any permuta- 
tion p. 
3.24 If(n, C)isatreeandifK =C ,; C, then Ccan be determined as a function 
of K (see Ross and Harary, 1960). Write a program for determining C from K, 
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SEARCH TECHNIQUES 


In classical applied mathematics most functions of interest can be 
approximated by some algorithm which becomes, for practical purposes, 
the definition of the function. In other areas, however, many functions of 
practical, if not general, interest (such as the correspondence between 
employee name and salary) can be specified only by an exhaustive listing of 
each argument value and its corresponding function value. Such a function 
will be called fortuitous, 

The basic algorithm applicable to the evaluation of a fortuitous function 
is a search of the list of arguments, i.¢., a comparison of the given argu- 
ment with the list of arguments to determine the correspondent to be 
selected. In such an algorithm it is convenient (as illustrated by Program 
1.12a) to distinguish three phases which successively determine the 
following quantities: 


(1) the index or rank ¢ = kek of the argument k in Rk. 
(2) the index / = p, of the correspondent in s. 
(3) the correspondent s = s;. 


Step (2) is a permutation defined by the permutation vector p. Steps (2) 
and (3) are simple selections from structured operands (normally in linear 
representations) and require no further discussion. Step | is called 
ranking, and the methods for accomplishing it merit detailed treatment, 

The argument & of a mapping (and hence ofa ranking) operation will be 
called a key. The German-English dictionary mapping of Example 1.2 ts 
typical of mappings from key to correspondent. Ranking is itselfa special 
mapping from the key set & onto its own index set uU((R)), 

If the representation used for some or all of the data imposes certain 
restrictions (such as serial access), there may be some advantage in 
coalescing the three phases of the mapping operation so as not to determine 
the rank explicitly. It will, however, be convenient to limit the discussion 
almost exclusively to the problem of ranking. 

There are two main types of ranking processes: scanning and key 
transformations. A scanning process compares the key & successively with 

133 


134 Search techniques 


selected elements of the key set k to determine the rank r= kek. A key 
transformation is any function or algorithm ¢(k) which maps the set k into 
some subset of the integers. The set of derived kevs is defined as the set 
d containing all derived keys arranged in ascending order. The set of all 
keys which map into d, is called the jth equivalence class defined by the 
transformation f. 

If »(d) = +(k), the key transformation is biunique and the ranking 
operation may therefore be completed by a permutation p such that 
Pp, = ifor 7 = «k;). Ifv(d) < »(k), then at least two distinct elements of k 
map into the same element of d and the ranking process must be completed 
by a scan of one of the equivalence classes defined by 7. 

If, for example, k = (m, tu, w, th, f) is the ordered set of working days 
encoded according to the encoding matrix 


| m 0 0 1 
2 tu 0 1 0 
3 C=Ttw 01 1 
4 th 1 0 0 
5 fF — J od 


and format vector f = (Q, 1, 1, 1), then scanning can be accomplished by 
comparing e(k), the encoded representation of the key A, with successive 
rows of f/C to determine the rank r of the row on which agreement occurs. 
Moreover. the key transformation 


t(k) = (2€) p(k) 


is unique, but requires an associated mapping vector m = (1, 2, 3,4, ~, 
, 5). Had pe(f) (that is, f/C°) been chosen as (1, 0. 1), the mapping vector 
would not have been required. Finally, the key transformation 


(A) = 1 + p3(k) 


has the range d = (1, 2), is not unique, and requires a subsequent scan of 
one or other of the equivalence classes e' = (tu, th), and e® = (m, w. f), 
represented by (010, 100) and (001, 011, 111), respectively. 

Although a strict ranking operation maps element k; into the integer /, 
any biunique mapping onto the index set u!(»(k)) will frequently serve as 
well. For, if pis the permutation required to complete the ranking process, 
and if a subsequent permutation j is required (as in step 4 of Program 
1.12a), the two permutations can be combined in the single permutation 
q = jp. Similarly, the ranking ofa set k may be considered as equivalent 
to the ranking of any set obtained by permuting R. 
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4.1 SCANNING METHODS FOR RANKING 


The two main methods of sean are called directed and controlled. A 
directed scan is begun in one of two possible directions from a given initial 
point #, either ascending (that is, i, i + 1. i+ 2,...) or descending. The 
direction chosen may be determined by a comparison between the given 
argument and the set element k; at the initial point /, A controlled scan is 
executed in a sequence which is determined by successive comparisons 
between the argument and each element scanned. In an effective controlled 
scan, cach comparison must determine the choice of the next element for 
comparison so us to (approximately) minimize the expected number of 
elements scanned. The directed scan is clearly well suited to the use of a 
file or chained representation, which imposes serial access to the elements, 
whereas the controlled scan Is not. 

The scan length (i.e., the number of key elements scanned) will be used as 
a measure in analyzing and evaluating scanning methods. The same 
measure and the same analysis apply also to the converse situation, where 
the rank of an element is given and the element itself must be obtained 
from a chained or other representation which permits only serial access. 
An alternative related measure is the normalized scan length or scan 
fraction, defined as the scan length divided by the number of elements in 
the set of keys. 

A scan is said to be rooted if each execution begins at the same point r. 
A rooted scan may be advantageous when the frequency distribution of the 
arguments is nonuniform and the most frequent keys can be grouped near 
the root. A sean is called catenated if each execution is begun at the end 
point of the preceding scan. A catenated scan may be employed tn using 
a file when the intervals between successive scans are so short as to allow 
little or no time for return to a fixed root or when the arguments are 
arranged in the same relative order as the items in the file. 


Directed scan 


A directed scan is called cvelic if element R, follows R, in an ascending 
scan and if k, follows k, ina descending scan, It is called nonerelic if the 
direction of scan is reversed whenever either of the terminal clements R, or 
k, isencountered. A cyclic scan Is appropriate to a chained representation 
with end-around chaining: a noncyclic scan is appropriate to a file or to a 
chained representation without end-around chaining. 

The initial direction of scan may be chosen in several ways, the more 
important of which are enumerated and discussed below. For independ- 
ently and uniformly distributed arguments, the expected scan fractions 


| Zz 
é 


CWO aenDAneEWne 
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Position Direction 

Any Fixed, continue or reverse 

Any To argument 

1 Fixed 

Catenated Fixed 
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Catenated To argument 
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Catenated Continue previous 

Catenated Reverse previous 
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are summarized in Table 4.1. Certain of the results are also plotted in 
Fig. 4.2. 


Initial direction fixed (fixed scan), The ascending direction willbeassumed, 
If the scan is rooted and cyclic, the root may, without loss of generality, be 
assumed to be one. The expected sean length for a set is then given by 


v(R) 


c= Zoe) xisefid 


where /(R;) is the normalized expected frequency of occurrence of the 
argument &; and f ts the corresponding frequency vector defined by 
fi = fbi). 

The most efficient fixed rooted scan is therefore obtained by using the 
permuted set a = (4/(—f))k such that the components of @ are arranged 


1.0 


pr (f =b) —> 


° 
or 


! | { 
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Scan length 6 -——~> 


0 | 


Figure 4.2 Plot of cumulative probabilities of Table 4.1 (Numbers refer to the 
entries in Table 4.1) 
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in decreasing order on frequency. If the distribution of arguments is 
uniform (that is, f; = 1/»(f)), then the expected scan length is (x(k) + 1) + 
2, and the expected scan fraction is approximately one half. 

If a fixed scan is cyclic and catenated, the expected scan length depends 
on the distribution f, but if the arguments are independently distributed, 
then the expected sean length is independent of the ordering of the scanned 
set k. This may be shown as follows. The expected length J, of a scan 
rooted at r is given by M* % f, where M is the square matrix such that 
M’ = (r — 1) lu. The probability of beginning a catenated scan at r is 
the probability of ending the previous scan at r, that is, f,. Consequently, 


e=fxrl=fiMif. 


Since, in general, fi Mo f=ftM=f, then 2e=(f.N-f) 
where N= M + M. If, for example, o(k) = 4, then 


12 3 4 2 6 6 6 

4 1 2 3 6 2 6 6 
M= ‘ and N= 

3.4 1 2 6 6 2 6 

2 4 1 6 6 6 2 


It is easily shown that N is of the form 
N = 2E + 0(k)h 


and consequently NP = N for any permutation p. But f. NJ f= 
(fp) (NE) (fp) in general, and since NP=N, then fy N. f= 
(f,) . N . (f,). Hence the expected scan e = (3) x f . N . f remains 
the same for any permutation of f or, equivalently, for any permutation of 
the key set k. 

If the arguments are not independently distributed, the analysis is, in 
the general case, very complex. However, a simple but effective use of 
correlation is made in the method of bafching. If a isa collection or batch 
of uncorrelated arguments, each of which is to be ranked in the set k, then 
the total expected scan time for a fixed catenated scan will be »(a) times the 
expected scan time for a single item. If, however, the set a is ordered on k 
(that is, @ = k © a), then the entire set a may be ranked in a fixed cate- 
nated scan whose normalized length does not exceed one.* If a given 


* The length of the scan will be determined by the maximum rank (in k) occurring in 
the sct a. The expected value of the scan fraction is approximately equal to the expected 
value of the maximum occurring in a sample of size # = 7(a) chosen from the continuous 
interval from zero to one. This value is known for various distributions [e.g., Cramér 
(1951) p. 370]; for a uniform distribution it ism + (# + 1). 
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argument set 2 is not ordered on k, it may first be permuted by some sorting 
process to yield the set a = pJz which is ordered on k. The set a may then 
be ranked in & and, if required, the set of ranks may then be subjected to 
the inverse permutation g = p1,u' to yield the ranks in the original set 2. 
The decrease in the total expected scan length may far outweigh the effect 
of the additional permutations required. 

If a fixed scan is noncyclic and rooted at one, the expected scan length is 
the same as for the fixed cyclic rooted scan. The fixed noncyclic scan is 
generally unsuited to any initial points r other than one (e.g., to a catenated 
scan), since the first (r — 1) elements of k are then reached only after a 
reversal of direction and a rescan of the set &’/k. 

For the case of a uniform distribution, the behavior of the fixed scan is 
summarized in entries 1, 3, and 4 of Table 4.1. The derivation will be 
illustrated in discussing entry 7. 


Initial direction giving shortest scan. For a fixed root r, the minimum 
expected scan is achieved if the items are disposed on either side of the root 
so that the frequency f,; is a monotone decreasing function of |/ — r|, the 
scan length in a direct scan to the argument. In acyclic scan, the position 
of the root is immaterial; in a noncyclic scan it is best centered at the floor 
(or ceiling) of ((k) + 1) +2. In a O-origin system this expression 
becomes 7(k) + 2. 

For an arbitrary frequency function, the expected scan length is given by 
the scalar product f | J, where / = |t!) — re}. For a uniform distribution, 
the results are given in entries 2 and 5 of Table 4.1. Fora catenated scan, 
the corresponding results appear in entries 2 and 6. 

The possibility of choosing the initial direction so as to give the shortest 
scan to the argument depends on the information available. If the elements 
of k are strictly ranked on some function g(k,), then the shortest direction 
from root r to argument x can, in the noncyclic case, be determined by a 
comparison of g(x) and g(k,). For the cyclic case this does not suffice, and 
it is necessary* to know the index in & of the argument w. This case ts 
therefore of interest primarily in selecting a specified element from a 
serial-access representation and is of little interest in an actual ranking 
operation. However, any double-chained representation or reversible file 
can be used in a noncyclic as well as a cyclic manner and hence admits of a 
choice of direction which is best in the noncyclic sense. 


Initial direction to nearer (farther) terminal. Vf the value of the root r is 
known for each individual scan in a catenated scan, the direction to the 
nearer terminal can be determined by comparing r with the midpoint 


* Approximating functions may, however, be used for estimating the index and the 
probable best direction. 
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((k) + 1) + 2. A noncyclic scan starting toward the nearer end is clearly 
less efficient than one starting toward the argument, but it may be useful 
when the most direct route to the argument cannot be determined. The 
expected scan fraction is shown in entry 7 of Table 4.1; its analysis will 
illustrate the method used in constructing the entire table. 

It is assumed that the number of elements »(k) is sufficiently large that 
the scan fraction f may be considered as a continuous variable. Let 
pr (f < 4) be the probability that scan fraction f does not exceed 4, and let 
the function be represented in three partssuch that pr ( f < 6) = pr, (f ~~ 4) 
in the ith half-unit interval in 6. Let 2 be the normalized initial position of 
a given scan. Then 0 <x <I, and, since the scan always begins toward 
the nearer terminal, the fraction of the set covered in a scan of length #4 is 
the same for the starting point (1 — x) as for x. Using this symmetry, 
attentioncan be restricted to valuesof.x in the range 0 to 3. For the function 
pri (f < 5), the value of 4 is also restricted to the range 0 to 3. 

Consider fixed values of band x witht < 3. If0 <x < 5/2, the fraction 
of the file covered by a scan of length 6 is given by 6 — 2, for the scan 
begins at x, proceeds a distance x to the nearer terminal, and returns to 
the point b — x. If 6/2 <« < 4, the fraction covered is clearly x, for the 
scan will reach the nearer terminal but will not return past «. Ifh <«# =< 4, 
the scan does not reach the nearer terminal, and the fraction scanned is 
therefore 4. Since x is uniformly distributed, the function pr, (f << 4) is 
obtained by integration as follows: 


b ly — hf? 
[ edx+t | b ar| = +b. 
vb 


“'b/2 2 


u/2 
pif <b) = al | (b — x) dx + 
0 


The factor of two arises from the symmetry in 2 and the restriction of x 
to the interval O < 2 < 4. Similarly, 
; v/2 te b? I 
pra(f <b) = 2| (b— x)dx + rde] =P 4d, 
0 PQ 2 4 
and 
b-1 ly 5 
prn(f <b) = 2| | dx +| (b — x) ae| = —b 4+ 3b-— 1 
0 b-J 


Entry 8 shows the behavior of the scan starting toward the farther 
terminal. Although the distribution differs markedly from that obtained 
for starting toward the nearer terminal, it has the same expected value of §. 
As may be expected, the function obtained for a fixed scan (entry 4) is 
the average of the functions obtained for cases 7 and 8 and is linear in 6. 
Case 7 (toward nearer terminal) yields the smallest maximum scan length 
of the three. 
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Initial direction reversed (continued) from previous scan. (n the absence of 
any other basis of choice, the initial direction can be chosen as a reversal 
or continuation of the direction which terminated the preceding scan. The 
behavior is shown in entries 9 and 10 of Table 4.1. The relations between 
the functions for fixed scan (F), continuation (C), and reversal (R) are 


Fo-R=-(F-—C)<0 for b <1, 
and F-—- R=—-(F—-—C)>0 for bv 1. 


Controlled scan 


The sequence followed in a controlled scan is commonly determined by 
a comparison which determines the relative ranking of any pair of elements 
x and yin the set k. It will therefore be assumed that comparison of the 
argument « = k, with the element k, determines whether A << j, 4 = /, or 


— j<1 
k < r(k) 
J-l@+h+2] 

x ik; L—> 
> 

— k~j-1 
ijt 


Program 4.3. Ranking of « in k& by binary search 


ii j. The subsequent scan may then be limited to one or other of the 
two subsets a’ '/Rand a//k. The maximum dimension of the subset remain- 
ing to be scanned is therefore minimized by choosing / = |[@(k) + 1) + 2]. 
If each subsequent element for comparison is chosen so as to (approxi- 
mately) halve the dimension of the set remaining to be scanned, the 
process is called binary search. Program 4.3 shows the details of binary 
search; / and k are the indices of the terminal elements of the remaining 
subset, and / is the index of the element selected for comparison. 

If x(k) = 2", then any one of 2/"' different arguments may be isolated on 
the jth comparison, for / eu'(k), and the one remaining argument will be 
located on the (A + 1)th comparison. Hence for a uniform distribution of 
arguments, the expected number of comparisons required in a binary 
search is given by 


e(2") = (129+ 2-2!4 3-2 4-5-4 -2" 14 (hk +:(1)) = 2. 
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It can be shown (e.g., by induction on &) that 
e,(2") = [(k — 12" + k 4+ 2] + 2". 


The expected number of comparisons is therefore approximately (k — 1), 
and for a general value of »(k), the number is approximately [log, »(k)| — 1. 
The expected value thus differs but slightly from the maximum value 
[logy (o(k) + 1]. 

If e,(v(k)) is the expected number of comparisons required in a fixed 
scan of k and if r is the ratio of the execution time for one step of binary 
search to the execution time for one step of fixed scan, binary search is 
(for a uniform distribution) the more or the less efficient according as e, 
exceeds or is exceeded by re, Although the simplicity of Program 4.3 
suggests that the ratio r is small, it will be large if the representation of the 
elements of k permits serial access only. 

The methods may be combined by using & steps of binary search to 
select one of 2" subsets, which is then subjected to a fixed scan. If the 
remaining subset contains m elements, the (approximate) reduction in the 
expected number of comparisons achieved by one further step of binary 
search is e,(m) — e,(m/2), and binary search should therefore be discon- 
tinued when e,(m) — e,(m/2) <r. For a uniform distribution, this result 
yields the following approximate expression for the optimum number of 
steps of binary search: 


- 1) 
s los: ( rk 

The ranking type of comparison required in determining the sequence 
in a controlled scan is always attainable for any arbitrary set & or for 
some permutation thereof. For, if e(R;) is the representation of R; in 7, 
if wm <u), if t; = (be) | p(k,), and if a = (6/t){k, then the relative 
ranking of any pair of elements of a can be determined by comparing the 
base 4 values of their representations. If, for example, b = 10, and the 
four successive elements of k are represented by (1, 0, 9), (0, 6, 4), (7, 1, 3) 
and (5,0, 6), then a@ is represented by (0, 6, 4), (1, 0,9), (5,0, 6), and 
(7, 1, 3), and relative ranking in a@ is determined by comparing elements as 
decimal numbers. 

In the execution of the binary search, the calculation of the index / (next 
element for comparison), and the explicit determination of the terminal 
indices /and k can be avoided by associating with each element k, a pair of 
indices which indicate the two possible succeeding choices for j. More 
precisely, if M is a matrix of dimension »(k) x 3, whose first column is the 
set k, and whose second and third columns are vectors of indices (or nulls) 
from the set U(a(k)) U (¢), then Program 4.4 describes a directed scan of k. 
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— JF Hl) + 2 = 2] 


vw: M,! oma 
= 
— jo™M, 
j —™M;' 


Program 4.4 Generalized binary search 


The elements of M, and M, can be so chosen as to execute a binary search 
equivalent to that of Program 4.3. This is true, for example, for the 
matrix M of Fig. 4.5. 

The ordering of the elements of M, is clearly immaterial, i.c., if M, were 
permuted, then columns M, and M, could be respecified so as to yield the 
original scanning order. One consequence of this is the fact that the rows 
can be reordered so that the scan conveniently begins with the first row 
rather than with row [(u(M) + 1) + 2]. A more important consequence 
is the possibility of applying the method to the problem of multiple keys, 
which will be raised in the treatment of key transformations. 

As illustrated by Fig. 4.5, the matrix M specifies a tree whose nodes are 
the elements of M,, whose branching ratios are two, and whose paths are 
traced by Program 4.4. The columns M, and M, can clearly be chosen to 
specify a scan sequence other than that of binary search. In particular, the 
element M,’ selected for comparison may be chosen so as to equalize (as 
far as possible) the total probability of the arguments in the two resulting 


1 bee ° © 
2 cab 1 3 
3 cat o 4 
4 dog © 0 


5 M=|egg 2 7 


6 foe ° ° 
7 had 6 8 
8 hoe ° 9 


9 nod 9° © 


Figure 4.5 Tree traced by Program 4.4 
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subsets rather than to equalize the number of elements. This procedure 
yields the most efficient controlled scan. If the probability distribution is 
uniform, the method reduces to binary search. If the arguments are 
drawn from two or more sets having distinct probability distributions, the 
matrix M may be enlarged to include two index columns for each distinct 
set. Each such pair of columns may then be designed to provide an 
optimum scan for the associated distribution. 


4.2 KEY TRANSFORMATIONS 


Since a key transformation maps the set of keys R into a set of integers 
(the set of derived keys d), any unique key transformation produces a 
derived key which can be used to select the component of a mapping vector 
directly and thus complete a ranking operation without the use of a 
scan, If the transformation is not unique, it may still be used to partition 
the original set & into +(d) subsets for scanning and so reduce the expected 
scan time. Ideally a key transformation should be both simple and unique 
and should produce a derived set d having a narrow spread; in practice, 
compromises must be made. 

Let & be the domain and d the range (in ascending order) of a key 
transformation /(k;) and let e? be the equivalence class in & which maps 
into d,, that is, (v) = d, for all wee’. The coalescence of t in k is then 
defined as the vector ¢ such that c, = »(e’), for j/eU((d)). Since the 
equivalence classes are disjoint and collectively exhaust Rk, then +/c = 
v(k). The spread of t in k is defined as | + d, — d,. Thus if R ts the set 
(Sunday, Monday,..., Saturday), and if ¢ maps each day into the rank 
(in the alphabet) of its leading letter, then d = (6, 13, 19, 20, 23), the 
spread s = 18, c = (1, 1, 2, 2, 1), and +/¢ = o(k) = 7. 

The key transformation is biunique if and only if ¢ = €. Moreover, if 
the transformation / is biunique, the ranking operation (1.e., the determina- 
tion of the index of the argument in &) can be completed by a mapping 
vector whose components are selected by the index / = #(k,) — d, + I, 
and whose dimension is equal to the spread of tin k. The key transforma- 
tion of the preceding example is biunique when restricted to the set » = 
(Sunday, Monday, Tuesday), the set of derived keys is (13, 19, 20), and 
the mapping vector m = (2,°,°,0,¢,°, 1,3) of dimension eight serves 
to complete the mapping if its components are selected by the index 
j= (x) — 12. 

A key transformation is called j-origin if d; = j. Since the origin can be 
changed by subtraction of a constant, attention will be restricted to 
l-origin transformations. The spread of a I-origin transformation is 
clearly d,,. 


§4.2 


A biunique key transformation is always 
attainable since, as remarked in the treat- 
ment of directed scan, the base 4 value of 
the representation of the elements of the 
domain Rk can be used. The spread of such 
a transformation may, however, be im- 
practicably large. If, for example, * were 
some small subset of the set of all ten- 
Jetter sequences, (¢.g.. all meaningful ten- 
letter words), then s would be 26!) and 
the required dimension of the associated 
mapping vector would be impracticably 
large. 

In general, if each element of & is of 
dimension / in m and if the (used) range 
of each element of 7 is the set U4), then the 
mapping vector required is of dimension 
b', The use of the base > value of the rep- 
resentation in selecting the component of 
the mapping vector is equivalent to select- 
ing a path through a uniform b-way tree as 
illustrated (using 0-origin indexing) in Fig. 
4.6, for b = 3 and 4 = 3. The branch to 
the /th level ts selected according to the /th 
component of the representation. 

Sequential level-by-level selection in the 
tree is less convenient than the direct use 
of the base ) value, except that the former 
frequently allows the full tree to be greatly 
contracted. If, for example, the tree of Fig. 
4.6 is used for the set k = (200, 010, 120, 
001, 022, 202) (as indicated by the numeric 
leaves whose values are the O-origin ranks 
in k), then the full tree can be contracted 
to the nonhomogeneous tree of Fig. 4.7. 
The contraction is defined formally as 
follows: if the subtree rooted in a given 
node contains no significant leaves, the 
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Figure 4.6 Uniform tree and 
mapping vector for the setk = 
(200, 010, 120, 001, 022, 202) 


subtree is replaced by a single null leaf; if the subtree contains exactly 
one significant leaf, the subtree is replaced by that leaf The contracted 
tree can then be represented by a chain list matrix or, since all nodes save 
the leaves are null, by a leaf list matrix M. For the example of Fig. 4.6, 
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0 3 0 
1 24 
2 6 0 
3 3 1 
4 14 
5 41 
6 M=To o 
7 . 0 
8 2 0 
9 01 
10 o 0 
ul 51 


Figure 4.7. Contracted tree and associated leaf list matrix M 
for the set k = (200, O10, 120, 001, 022, 202) 


M is given in Fig. 4.7. The sequential biunique key transformation on 
the leaf list matrix of the contracted tree is described by Program 4.8. 


Program 4.8. The components of the argument x are scanned by the index /, 
and step 5 determines / as the index of the current node in the path. If node / is 
not a leaf, then step 6 determines & as the index of the first node reachable from 
node /. If node /is a leaf, then & is specified as the value of the leaf, and the right- 
pointing exit is followed on step 8 unless the exit at step 7 is taken first. This exit 
occurs only if « is an illegitimate argument which leads to one of the null leaves 
(such as the last leaf of Fig. 4.7) remaining in the contracted tree. The contraction 
was performed in the specified manner so as to allow the incorporation of such a 
test. If it is not required, the tree can be further contracted by eliminating the 
null leaves. The left-pointing exit on step 4 also indicates an illegitimate argu- 
ment x, but one of insufficient dimension for the particular path specified. 


The biunique key transformation provided by Program 4.8 is very 
effective when applied to a set whose dimension is small compared to the 
spread of the transformation produced by taking the base 6 value of the 
representation as, for example, in a glossary of English words.* A dis- 
advantage of the process is the need to revise the entire leaf list matrix 
when additions to or changes in the argument set occur. The process can 


* See, for example, Lamb and Jacobsen (1961). 
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{ = «0 0-origin indexing 

@ a x Argument. 

Sf eg PL i Current node. 

4) 4 jo: vx) J Current index of argument. 

5 ick +4; k Index of first node reachable from 
- node / and finally the rank of x. 

° Kia Me M_ | Leaf list matrix. 

Ts are Rack M, | Mj! = 1<>hisa leaf. 

§ _—=] M,': 0 te M, | Combined leaf and chaining vector. 


Legend 


Program 4.8  Biunique transformation on key represented by x, using the leaf 
list of the contracted tree of key set k. 


be modified to produce a simpler but nonunique transformation by 
contracting the tree further so that some or all of the remaining leaves each 
represent two or more significant leaves of the original tree. 


Nonunique key transformations 

Although it is frequently impossible to obtain a sufficiently simple 
biunique transformation having a sufficiently small spread, it is always 
possible to produce a simple key transformation of arbitrarily small spread 
if the requirement of uniqueness is dropped. For example, the spread of 
the key transformation 


ihe) p(x) + d] 


varies inversely with d, but the transformation is usually nonunique for 
div 1. 

Ifa key transformation is not unique, the ranking must be completed by 
a scan of one of the equivalence classes which it defines. The scan of each 
of the equivalence classes e’ may, in general, be either directed or con- 
trolled, and the individual subsets may be ordered by frequency of occur- 
rence, by the base 4 value of their representations, or by some externally 
imposed (e¢.g., chronological) order. Ifa chained representation or file is 
used for each subset, a directed scan is normally used. 

The expected length of a directed scan of each of the equivalence classes 
e/ may be computed and weighted by the relative frequency of the class to 
yield an expected over-all scan length. If the distribution of arguments is 
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uniform, the expected scan length for e’ is given by 
We)+ DF25 (4+) +2, 


where c is the coalescence of the transformation. Moreover, the relative 
frequency of arguments from e/ is c; + (+/c). Consequently, the over-all 
expected scan length / is given by 


pa let 95], ke) Wo 
2(+/c) CKE 


For a fixed dimension of the derived set d (and hence of c), and for a 
necessarily fixed value of +/c = »(k), the value of / is clearly minimized 
if the coalescence vector c is uniform, 1.e., if the components of ¢ are all 
equal. Hence the expected scan length is minimized by a key transforma- 
tion whose equivalence classes are of equal dimension. 

A given key transformation is frequently employed to rank a variety of 
subsets of its domain & rather than k itself. For example, if & is the set of 
English words in a given dictionary, then one of the subsets to be ranked 
may be the set of distinct words in a particular sample of English text. If 
a particular subset of k is specified, then the coalescence of the key trans- 
formation in the specified subset * can be determined, and the transforma- 
tion can be chosen accordingly. More generally (as in the case of samples 
of English text), the active domain * may be only partially specified. The 
transformation should then be chosen so that its coalescence is nearly 
uniform for the expected active domains. If, for example, & is the set of 
all five-letter sequences, and if each active domain is the set of five-letter 
sequences beginning with a specified letter, then the key transformation 
used should depend only on the last four letters. If the set of derived keys 
produced by a key transformation has a spread s and a random uniform 
distribution within that spread, then the expected length of a scan (of the 
equivalence classes) can be shown* to be | + »(k) + 2s. 


Scanning of the equivalence classes. If an element x is to be ranked in & by 
a scan of F itself, no auxiliary information is required since the rank of 
component k, is simply its index 7. If some permutation of & is used 
instead, then an auxiliary ranking vector r (i.e., a permutation vector) 
must provide the rank in the given set k. Specifically, if y = p|k is used 
for ranking, then r is the permutation vector inverse to p, and the rank of 
element y, is r,. Finally, if the vector y is itself to be scanned in some 
prescribed order other than simple cyclic order, the order may be repre- 
sented by a chaining vector q. 

The vectors y and r or y, r, and g can be combined into a two-column 


* See Johnson (1961) and Exercise 4.4. 
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matrix S$ or a three-column matrix C which contains all information 
requisite to the ranking operation. More generally, a collection of mat- 
rices F, V. etc., can be used, each representing some subset of the given key 
set R. 

The method of scanning the equivalence classes defined by a given key 
transformation is largely determined by the type of matrix or matrices used 
to represent the key set k. The five major methods of practical interest are 
enumerated below. Each is illustrated in Fig. 4.9. A l-origin key trans- 
formation ¢ is assumed, 

(a) Overflow. A two-column matrix F of column dimension s = d, 
represents the first elements of the equivalence classes (and their ranks), as 
follows: 

FY = ey, j W014), 


Fy =o,i¢d. 


All remaining elements of the sets (i.e., a'/e’) are represented in arbitrary 
order in a two-column “overflow” matrix V. 

The scan procedure is given by the Program of Fig. 4.9a. If the given 
argument wx is not equal to F\“”, then the overflow matrix V is scanned in 
ascending order. The left-pointing exit indicates that x ¢ R. 

For a uniform distribution, the expected scan length is clearly given by 


/= 1+ (uV)+ 1) x «(V)) = 2r(k), 


where u(V) = »(k) — v(d). The expected scan length is therefore large 
unless the average dimension of the equivalence classes (that is, (Rk) + +(d)) 
is close to unity. For a known nonuniform distribution, the expected scan 
can be reduced by placing the most frequent element of each equivalence 
class in F and ordering the elements in V according to their frequency. 
(b) Overflow with chaining. The two-column matrices F and V used in 
method (a) can each be augmented by a third column chaining vector 
which chains each equivalence class. Thus F%* is the row index in V of 
element e,* if it exists, and is otherwise null. Similarly, if V," = e,’, then 
V," is the row index in V of e’., if it exists, and is otherwise null. The 
program 1s given in Fig. 4.96. The expected scan Jength for a uniform 
distribution can, as shown earlter, be expressed in terms of the coalescence 


vector c as follows: 
i 
p= (1454) 22, 


cle 


(c) Single table with chaining. In the overflow methods [(a) and (b)]. 
certain rows of the matrix F go unused, and a saving in storage can be 


k = (Sunday, Monday, Tuesday, Wednesday, Thursday, ] Friday 6 J cam | 
Friday, Saturday) 2 Sunday | I< t(x) 
3 F= | Tuesday 3 a ; 
(R;) = 1 + (6, 7,), where 4 ° ° a: Fy, 
5 ° ° 
n = (19, 13, 20, 23, 20, 6, 19) 6 Wednesday 4 [oF 
(n, is the rank in the alphabet of the first letter of R,) —» i<0 
1 Monday 2 _ 
z = (2, 2, 3, 6, 3, 1, 2), where z; = 7(R,), 2  W=/| Thursday 5 <j f: WV) 
3 Saturday 7 
d = (1, 2, 3, 6), and s = 6. ixi+l1 
x Vii 
Data of examples Overflow 
(a) iat 
1 Friday 6 6 ] Friday 6 0 
2 Sunday 1 1 —) i< tx) 2 Sunday 14 — + i<t(z) 
3 F == | Tuesday 3 2 y 3 Tuesday 3.5 
4 ° o 6 742: F, 4. T= | Monday 27 end i: o 
5 ° oo 5 Thursday 5 0 
6 Wednesday 4 © J<Fi b> 6 Wednesday 4 © =l2r: T, 
7 Saturday To 
(4 i<-F,t i<T,? 
1 Monday 2 3 _ 
2 V= | Thursday 5 © —j,i:o __, jot, 
3 Saturday To _ 
—ia: V;t 
Single table with chaining 
Overflow with chaining i<- Vt (©) 
(6) 
J <— Vit }— 
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TP¥§ 
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Sunday 1 2 ; 1 Friday 6 
Monday 2 7 — ite) 2 Sunday 1 
Tuesday 3.5 ; 3 Monday 2 
T | Wednesday 4 © i<m, 4 T = | Tuesday 3 
Thursday 5 © _ 5 Thursday 5 
Friday 6 6 <— i ie 6 Wednesday 4 
Saturday 7 © _ ; 7 Saturday 7 
x: Ty 
m= (6,1, 3,54) i<Ty 
Sf TE 
Single table with chaining and Open addressing system- 
mapping vector construction and use of 
(d) table 
(e) 


Figure 4.9 Programs and examples for methods of scanning equivalence classes defined by a 1-origin key transformation ¢ 
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effected by combining the three-column matrices F and V in a single table 
T of column dimension 


M(T) = max (s, v(R)), where s = d, = w(F). 


Let wu be a logical vector such that u/u(u(F)) = d. Then u//F constitutes 
the unused and u//F the used rows of F. Let v be a vector of dimension 
“(T) obtained by appending a zero suffix to u. The first two columns of 
T are then defined as follows: 


v/f(a"/T) = u/|(a?/F), 
v/[(a?/T) = a?/(V?), 


where p > u(u(")) and +(p) = +/v. (The vector p permits an arbitrary 
reordering of the rows of V). The third column of T is a chaining vector 
which chains each of the equivalence classes e’. 

The appropriate scan program (Fig. 4.9¢) is similar to that of method 
(b), and the expected scan length is identical. The serious disadvantage of 
the method lies in the construction of the matrix T—all of the rows 
(specified by v) required for the leading elements of the equivalence classes 
must be known before any of the nonleading elements can be allocated. 
The table T is therefore usually constructed in two passes over the given 
key set. Moreover, any addition to, or change in, the active key set k 
which introduces a new equivalence class may occasion reallocation of 
some row of T. 

(d) Single table with chaining and mapping vector. The main deficiency 
remarked in method (c) is occasioned by the fixed vector v and the fixed 
order of the rows of v//T, both imposed by the given key transformation ¢. 
The difficulty can be shifted from the matrix T to a mapping vector m 
which is used (as indicated in the program of Fig. 4.9d) to effect a further 
transformation of the index / = f(x). The rows of T may then be arranged 
in any desired order, provided only that for each heU(o(d)), m; = j, 
where e,;" = k, = T,’, and r(e,") =i. Moreover, if T, = k, then the 
ranking vector T., may be omitted. 

Except for the extra step occasioned by the operation j< m,, the 
expected scan length is again the same as for method (b). However, the 
requirement that “(T) = max (s, »(R)) may now be relaxed to the form 
MCT) = »(R), whereas v(m) must equal or exceed s. Since the squared 
length of the coalescence vector (that is, ¢ ~ ¢) can, in general, be reduced 
by increasing the spread s of the transformation /, the expected scan length 
can now be reduced at the cost of increasing the dimension of the mapping 
vector m rather than at the (usually much higher) cost of increasing the 
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column dimension of T. A similar advantage can be gained by employing 
the mapping vector m in methods (a), (b), and (c). 

(e) Open addressing system.* The open addressing system employs a 
single table T but requires neither the chaining vector T, nor the mapping 
vector m. As shown by the program of Fig. 4.9e, each argument x is 
obtained by a forward scan of T,, beginning at component Ti". Since 
the scan is cyclic, it is necessarily successful. It can also be made fairly 
efficient by constructing T as follows. The matrix is first specified as a 
matrix of nulls. The elements of k are then assigned in order, element k, 
being assigned to the first unassigned row following row t(k,) — I. 

The program of Fig. 4.9e describes both the construction and the use of 
the table T. The branch on step 3 can occur only if the element 2 has not 
yet been entered in the matrix T, and steps 5 and 6 then complete its entry 
and the specification of the corresponding component of the ranking 
vector T,. The use of T can, in fact, proceed concurrently with its con- 
struction, L.e., each argument x presented to the program defines a new 
entry in T if it has not previously occurred, the k index of w being deter- 
mined by some algorithm independent of T. 

If the active argument set & is not fixed, it may be desired either to add 
new elements or to respecify the rank of some element already defined in 
T. Respecification may be incorporated by allowing the scan used in 
defining an entry in T to terminate on encountering either the null element 
or the argument itself. Although respecification of an entry may be 
allowed, defetion of an entry and its replacement by the null element cannot, 
for the occurrence of such an inserted null element between the beginning 
point i = ¢(k,) and the point at which k, is entered in T would later cause 
an erroneous indication that k, was not defined in T. Replacement of a 
deleted entry by a special ‘deletion character’ distinct from the null 
element could, however, be used. 

The expected scan length in the open addressing system exceeds that for 
method (d), since the expected length of scan of each equivalence class is 
increased by the potential interleaving of elements from different equiva- 
lence classes. Thus, in the example of Fig. 4.9e, the expected scan lengths 
for each of the equivalence classes (Sunday, Monday, Saturday), (Tuesday, 
Thursday), (Wednesday), and (Friday) are (1 + 2 + 6)/3, (2 + 3)/2, I, 
and I, respectively, yielding an over-all expected scan length of 16/7. The 
corresponding scan lengths for a chained scan (e.g., method (d)) are 
(1 + 2 + 3)/3, (1 + 2)/2, | and 1, with an over-all expected scan length of 
11/7. However, since it uses a fixed scan, the open addressing system is 
better suited to a serial store than is the chained system. 


* The open addressing system appears to have been first used by A. L. Samuel, G. M. 
Amdahl, and E. Boehm in constructing address tables for an assembly program. 
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If the derived keys are uniformly distributed in the range | to “(T) 
then, as shown by Schay and Spruth (1961), the expected scan length is 
1+ p + 2(1 — p), where p = 2(k) + wT). For a nonuniform distribu- 
tion, the expected scan length can be reduced by allocating the most 
frequent elements first, i.e., by defining 7 from the set A reordered in 
descending order on frequency. 


Bucket files. \n certain files the locations divide naturally into blocks or 
buckets of n successive locations each, such that the entire contents of any 
bucket can be scanned in virtually the same time required to scan any one 
location in the bucket. Such a file is called a bucket file (Peterson, 1957). 
Ina magnetic disc file, for example, each track forms a bucket. Each of the 
foregoing methods of scanning equivalence classes can be adapted to suit 
the characteristics of a bucket file. The equivalence classes can be 
grouped in buckets, with chaining provided only from bucket to bucket. 


Clustering.* The active argument sets of interest may be relatively small 
subsets of the complete set k. Moreover, their elements commonly share 
some characteristic so that a key transformation which gives uniform 
coalescence and uniform spacing of the derived keys with respect to k 
may yield highly nonuniform coalescence or nonuniform spacing, or both, 
with respect to a given active domain x. This effect is called c/ustering. If, 
for example, each element of & is represented by a vector of decimal digits 
of dimension ten, then the key transformation 


(x) = [(10e) | e(x)) + 10°] 


yields a mapping onto the range .°(10*) which has both uniform coalescence 
and uniform spacing. On the active domain *, whose elements are all 
represented by vectors g(x) such that «?/p(x) = (2,4, 7), however, all 
elements “‘cluster” in the single derived key 247, 

The deleterious effects of such correlations among elements of the 
active domain can be reduced by employing key transformations which 
depend on all components of the representation and do so in a manner 
which shows no systematic relationship to the structure of the representa- 
tion. The mid-square method, for example, consists in squaring the given 
key and extracting the middle digits of the resulting product. A commonly 
used transformation is the taking of residues modulo some number n 
such that m => +(x) and is either prime or contains few factors. 


* Note added in proof: M. Hanan and F. P. Palermo offer an important solution 
to clustering by the application of Bose-Chaudhuri codes. R.T. Chien and C. V. 
Freiman have remarked a similar application of Fire codes (private communications). 


§4.3 Multiple keys 155 


4.33 MULTIPLE KEYS* 


If in some mapping operation the access to both the key set and the set 
of correspondents is serial (or partially serial), considerable advantage can 
be gained by replacing the vector T, of ranks by the suitably reordered set 
of correspondents, that is, T,’ becomes the correspondent of the key T,’. 
For, the ranking operation on the argument k, which gives access to the 
element T,’ = k, also gives immediate access to the correspondent T,' in 
the same row T’. This is equivalent to eliminating the permutation 
operation (through reordering of the set of correspondents) and coalescing 
the ranking and selection phases so that together they require a single 
access to the (partially) serial memory. 

For a single functional correspondence, the coalescing of the ranking 
and selection phases can (by a suitable ordering of T) be accomplished by 
the single-table process (Fig. 4.9c) without introducing the mapping 
vector m of process (d). Frequently, however, a number of related func- 
tional correspondences must be provided between pairs of a family of 
vectors Q, so ordered that Q," corresponds to Q;* for all i,j, and k. Inan 
accounting system, for example, Q;, Q,, Qs, and Q, might be, respectively, 
the vector of account numbers, names, addresses, and balances in a given 
ledger. Those vectors which may occur as arguments in a mapping process 
are called key vectors: those which never occur as arguments are called 
satellite vectors. 

Q may be reordered (and augmented by a suitable chaining vector) so as 
to permit the use of the program of Fig. 4.9c for some selected key set Q,. 
However, for any other key set Q,, the order will, in general, be unsuitable. 
The program of Fig. 4.9¢ may, however, be used together with an appro- 
priate mapping vector m/ and chaining vector q’. For the sake of uni- 
formity and the advantage of allowing an arbitrary ordering for Q, the 
distinguished key set Q; may also be provided with a mapping vector m!’ 
and treated like the rest. 

The generalized binary search of Program 4.4 can be applied to the case 
of multiple keys by providing a pair of chaining vectors (M, and M3) for 
each key. The open addressing system is clearly unsuited to multiple keys. 
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EXERCISES 


4.1 Give a formal proof of the fact that the binary search of Program 4.3 will 
not work properly if steps 5 and 6 are replaced by the statements & <~ j and i <- jf, 
respectively. 


4.2 (a) The argument &; in the set of keys R = (R,, Ry, ..., R,) occurs with the 
relative unnormalized frequency /. For the case »(k) = 10, design the 
matrix M which will minimize the expected scan length when applying 
Program 4.4. 

(b) Show how additional keys may be incorporated in the system of part (a) 
without revising the entire matrix M, Discuss the effects on the expected 
scan length. 


4.3. Consider the ledger L defined as 


3125;ADAMS, S. H.°°/435°ASHee29900 
O168;/BAKER, J. C.°9° |76°ELMeececsnvs 
7926)/FOX, R. L.cc°e |435°LAUREL?2°° 
3420;FOX, R. L.cc°° |435°LAUREL?®°°2 
1925|/HILL, K.eccce LL8°LINDEN?2:ee%e 
2486|/JONES, J. C.vo J6L°MAPLE°?®?°° 2% 
9127/JONES, J. C.ce [73 6°LELNDEN??2%? 
6135/KING, K. M.°°° /76°ELM>eeese0e00 


and the argument domains k!, k?, and k?, consisting of all 4-digit decimal numbers 
(account numbers), all 12-letter names (in capitals, with null fill), and all 14- 
character addresses (alphanumeric with null fill), respectively, and let the rows of 
at/L, (4 | a)/L, and w'4/L represent the corresponding active domains x1, x®, 
and x3, 

(a) Specify a simple key transformation on the set k! whose range lies in the 
set U((L)), which, when applied to the active key set x’, yields a derived 
key set of dimension three or greater. 

(b) Reorder the ledger L and add a chaining vector to chain the equivalence 
classes so that the resulting matrix M may be used with the key trans- 
formation of part (a) and a program of the type of Program 4.9c. Show 
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the resulting matrix © and the specific program used for selecting the row 
M'’ determined by an argument a x'. 

(c) Specify simple key transformations on cach of the sets Rk? and k? which 
yield derived sets of dimension not less than three when applied to the 
active domains x? and x°, respectively. 

(d) Augment the matrix M of part (b) by a permutation vector and a chaining 
vector suited to cach of the key transformations of part (c). 

(e) Write a program which selects, as a function of a and j/, the row of M 
corresponding to the argument a ¢ x’, for j = 1, 2, or 3. 

44 Lett; = 1(k;), where fis a key transformation such that f ¢ LO(R)). The 
vector ft takes on nv” distinct values (where # = »(R)), which are assumed to be 
equiprobable. For nm = 2, the cases are (1,1), (1,2), (2.1), and (2, 2), with 
expected scan lengths [, 3) t, and 2. 

(a) Show that the over-all expected scan length is 1.25 for n = 2. 

(b) Calculate the expected sean lengths for a = 3 and for” = 4, 

(c) Generalize the result of part (b) to show that the expected scan length 
rapidly approaches 1.5. [See Johnson (1961) for an alternative derivation.] 


4.5 Design an open addressing system for the ledger L of Exercise 4.3 and the 
key transformation of part (a). 

4.6 Program and discuss the extension of binary search to m-way search. 

4.7 Let J; be the time required to scan over the th element of a vector * which 
is represented in a chained (or other serially-scanned) representation, and let f; 
be the frequency of occurrence of the argument *x;. Discuss the role of the 
“standardized frequency” s = f + / in determining the optimum ordering of the 
vector x. 

4.8 The neighbors problem requires that the near neighbors of an object in 
n-dimensional space be found. The technique used depends on the dimensional- 
ity and the particular criteria of adjacency. 

(a) The position of a vehicle on a turnpike is represented by the distance in 
miles of the vehicle from the south end, and p,; is the coordinate of the ith 
patrol car. Write programs to: 

(i) identify the patrol car nearest an accident at position a, 
(1i) identify the two cars nearest to each other, 

(b) A three-column matrix V specifies the locations and radio call-signs of a 
fleet of merchant vessels on a flat lake, where V,' is the call-sign of the /th 
vessel, V,' is its distance in miles from the meridian tangent to the lake on 
the west, V,' is its distance in miles from the perpendicular tangent to the 
Jake on the south. Write a program to determine the call-sign of the neigh- 
bor nearest to a distressed vessel whose call-sign c is given. 

(c) The matrix of part (b) is used to specify call-signs and locations at time ¢ 
of a fleet of bombers over a flat territory. When each bomb ts released, 
neighboring planes must be at a safe distance. Construct a program which 
will find the call-signs of all pairs of planes within r miles of cach other at 
time f. 

(d) In a certain hydrodynamic calculation, the motion of each elementary 
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volume of fluid is traced. The ith elementary volume is described by row Li 
of a matrix: (4 | a)/L represents the three space coordinates (in a recti- 
linear system), and «7/L represents the remaining parameters. At each 
time step the parameters of volume / are redetermined by those of the four 
elements nearest it. Write a program to determine «4/L? as the set of 
indices of the four nearest neighbors of element i. Hint. Attach an explicit 
index vector before sorting. 


chapter 5 


METAPROGRAMS 


It is frequently necessary to treat a program as an argument of some 
process, as in the systematic analysis of a faulty program or in the trans- 
lation of a program expressed in one language to an equivalent program 
expressed in a second language. Such a process defined on programs may 
itself be formalized as a mefaprogram. 

Formally, a metaprogram is defined as a program whose domain is a set 
of programs, each element of the domain being called an argument 
program. If the range of a metaprogram is also a set of programs, the 
metaprogram Is called a trans/ator. An element of the range of a translator 
is called a function program; Le. a translator operates on argument 
programs to produce function programs. A metaprogram whose range Is 
not a set of programs is called an analyzer. Thus an analyzer produces, not 
a function program, but data useful in the analysis or application of the 
argument program. If, for example, the instructions of a computer 
program are sorted on the data address and listed, the list brings together 
all data references to each register used and therefore facilitates analysis of 
the (possibly conflicting) uses of registers in the program. A metaprogram 
which schedules and directs the execution of other programs (and meta- 
programs) is called a director or supervisor. 

Four main types of translator are distinguished: compilers, assemblers, 
generators, and interpreters. A compiler accepts programs expressed in a 
given language (the argument language) and produces corresponding 
programs expressed in a second language (the function language). 

An assembler is a special case of a compiler which is limited as follows: 
(1) the statements of the argument program are virtually independent and 
may therefore be treated one at a time, and (2) the statements of the 
argument program are simple (not compound) and need not be analyzed 
into component statements. There usually exists a fixed correspondence 
between the operations of the argument program and those of the function 
program; the translation thus consists essentially of a substitution of 
symbols for the operations and/or operands. 

A generator produces, by specialization, any one of a family of function 
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programs. Thus a single generator might, for each specified value of a, 
produce a specialized program for evaluating the function 2”. The 
argument of a generator is usually viewed as two distinct parts: the 
skeleton program, which determines the family of potential function 
programs, and the specification, which determines the particular member 
of the family produced. Generators are frequently incorporated in 
compilers. 

A translator is called an interpreter if it (1) executes the segment of 
function program corresponding to a statement of the argument program 
immediately after it is produced, and (2) selects the statements of the 
argument program in a sequence determined by the execution of the 
function program. The function program itself is normally treated as an 
intermediate result, and only the outputs of the executed argument program 
are retained. The execution of an interpreter can therefore be viewed as 
follows: each statement of the argument program is first “interpreted” 
in the function language and is then executed. 

The trace program and the utility program are special cases of the inter- 
preter. The former executes the argument program without modification 
but produces, in addition to the normal outputs of the argument program, 
a trace of the argument program listing each instruction executed and the 
intermediate results produced. In a narrow sense, a utility program ts an 
interpreter whose argument program is supplied directly by a computer 
operator via the control console. More broadly, the term is used to denote 
any program frequently used by the computer operator. 

The present discussion of metaprograms will be limited to one important 
aspect of compilers, the analysis of compound statements occurring in the 
argument program. The treatment embraces the translation between the 
common parenthesis notation and the Lukasiewicz (1951) notation which 
proves most convenient in the analysis of compound statements. 


5.1 COMPOUND STATEMENTS 


Each statement of a program can be considered as an operator which 
maps the given operands into the specified result. If the operator corre- 
sponding to a given statement belongs to a given set of operators p, the 
Statement is said to be elementary in p. A finite program whose operators 
all belong to a set p is called a program in p. A statement which is not 
elementary in p but which can be expressed as a finite program in p is said 
to be compound in p. The analysis in p of a compound statement is the 
specification of a corresponding finite program in p. 

For example, the statement 


z<(*@+y)xrt+(s—)" 
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is compound in the operator set 
p = (addition, subtraction, multiplication, exponentiation), 


and Program 5.1a shows one possible analysis in p. Program 5.1b shows a 
similar analysis in the set 


g = (addition, subtraction, multiplication, branch). 


A metaprogram that translates all statements which are elementary in a 
set of operations p can be extended to translate statements which are 


—, a«+av+y am er ear ey 
b+axr b<axr 
c«~s-—t CxS Se 
d+" d<1 
z<-b+d > i<n 

Sj) iei-| 
d<«cxd a 
2<b+d - > 
Analysis in p Analysis in q 
(a) (d) 


Program 5.1 Analysis of the compound statement 2 <-(r + y) xr + (5 — 1)" 


compound in p by the addition ofa metaprogram for analyzing compound 
statements. The conventions adopted for the representation of compound 
statements must, of course, be complete and precise so that interpretation 
is unequivocal. These conventions should be familiar and convenient to 
the programmer and should also permit easy analysis by a metaprogram. 
The common parenthesis notation of elementary algebra is congenial to 
programmers, whereas statements in Lukasiewicz notation are easier to 
translate and evaluate, easier to transform to an optimum form which 
minimizes the amount of intermediate data storage and execution time 
required in their evaluation or translation, and possess the simple criterion 
for well formation developed for the left list matrix of a tree in Sec. 1.23. 
The analysis of compound statements will therefore be discussed in terms 
of Lukasiewicz notation, and algorithms for translating between the 
parenthesis and Lukasiewicz notations will be presented. 
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Figure 5.2. Representations of a compound statement 
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Figure 5.2 shows several alternative representations of a compound 
statement. The tree representation (5.2a) is perhaps the most graphic, and 
the other forms may be viewed as alternative representations of the tree. 
The common parenthesis form of Fig. 5.2e, for example, specifies the 
requisite structure primarily by grouping the nodes of each subtree within 
parentheses with the root in the middle of the group. As shown in Sec. 
3.4, the left list matrix of Fig. 5.2¢ can be supplanted by the simpler left 
list vector of Fig. 5.2¢, providing that the degree of each operator p is a 
known function 0p). 

The left list vector notation for a compound statement is also called 
Lukasiewicz,* Polish, or parenthesis-free notation. The Lukasiewicz and 
the parenthesis notations will hereafter be referred to as “-notation and 
.P-notation, respectively. 


5.2 LUKASIEWICZ NOTATION 


Although “-notation can be viewed as the left list vector of a tree 
representing a compound statement, it is helpful to develop an alternative 
equivalent formulation as follows. Let fand p be two disjoint sets whose 
elements are literals and operators, respectively, and whose union v = 
IG) p is called a vocabulary. A strictly posttive integral degree function 
a(p) is defined on each element of p, and each operator p of degree d 
accepts d elements of ! as operands to define a result or value in £ In 
symbolic logic, for example, f = (0.1), p= (A.V. ), OCA) = OCV) = 
2,and o(” ) = 1. Consistent with these notions, the degree of each literal 
is defined to be zero.t 

Each operator p of degree »(q) defines a function (1.¢e., a mapping) from 
each vector q ¢ / into an element y ¢f. This function is denoted by the 
vector f = (p) ©) q. The vector f is called an 4-phrase of length »(f), and 
the element y is called its ra/ue. Table 5.3 shows, for example, the “- 
phrases in the system for symbolic logic based on and, er, and not, The 
vector f is clearly contained in the vocabulary v, that is, fc v. Where its 
omission raises no ambiguity, the prefix will be dropped from the term 
“¢-phrase” and from similar terms to be defined. 

A vector 2 © v is called an 4-formula of length (2%). In particular, 
every phrase isa formula. The degree vector of a formula 2 will be denoted 
by 0(z) and defined by the relation (0(z)), = 0(2,). 

* First proposed by Lukasiewicz (1951) and first analyzed by Burks et al. (1954). 

+ The system is extended to include variables as well as literals by considering the 
vocabulary vox: I. p, where x ts the set of variables, and *¥ OCU» p) = e(0). 
The degree of each variable is, like that of a literal, defined as zero. The domain of the 
operators is still confined to the set £, and in any legitimate algorithm each variable is 
specified as a literal by some statement before it enters as an argument. 
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Phrase Value 


(Vv, 0,0) 
(Vv, 9, 1) 
(Vv, 1,0) 
(v,1,) 
(A, 0, 9) 
(A,9, 1) 
(A, 1,9) 
(A, 1,0) 
(~,9) 
(~,1) 


Table 5.3. Phrases in “-system for symbolic 
logic based on operators and, or, not 


If some infix of 2 is a phrase, and if the infix is replaced by its value, then 
the resulting vector y ts called an #-reduction of z. If jis any “-reduction 
of z which cannot be further reduced, it is called a complete reduction of 
or, since it can be shown to be unique, fhe complete reduction of z. Com- 
plete reduction of z will be denoted by “(z). A formula 2 is said to be 
singular* if its complete reduction is a single literal, i.e., if (7(z)) = | and 
L(zjel. Thus q =(A,1, V,0,1) and r= (1) are singular, but s = 
(A, 1, 1,0) and t = (A) are not. 

For example, complete reduction of the singular formula z = (A, -/, 1, 
A,9,1, Vv, 1, —, 1) may be performed as follows: 


s=(A, V; 1, A,9, 1, V1, 15) 
gia (A, v,1, A, 90, 1, Vv, 1,0) 


a (A, V,1,A,0,1, 1) 

eG= (A, V, 1,0, 1) 

sts (A,1,1) 
—._—" 

= 1 


Program 5.4 shows the complete reduction of a formula z, including tests 
of singularity. 

Program 5.4. The components of the given formula x are examined in reverse 
order and assembled into a stack vector y = (%;, %;,4,..-, %,), where 2; is the 


* The term well formed used by Burks et al. (1954) and others is avoided here because 
singularity implies not only that the formula represents a well formed tree but also that 
the tree is singular. 


§5.2 Lukasiewicz notation 165 


1 s<((y) = 1) l-origin indexing 
> — 4 — 
~ s<0 z | Given formula. 
3 y — (0) y | Reduced suffix of 2. 
4 i<r(z) +1 i | Index of 2 in descending scan. 
5 j<G-1 = j | Dimension of phrase to be 
reduced. 
6 yr E)OY s | Singularity indicator. 
7 }={ 60) : 0 
8 j-5y) +1 Legend 
. > 
9 ity) ne 
10 va £(a)/y) 
1] CH} y-Q@) O@ly 


Program 5.4 Evaluation of the formula z 


component currently examined. When an operator (1.e., a node of nonzero de- 
gree) is first encountered, the prefix of y forms a phrase of dimension j = 0(y,) + 
1, which is immediately reduced (i.e., evaluated) on step 10 and is then replaced 
by its reduced value on step If. Singularity of the vector 2 is indicated by a non- 
zero value of s, which is set to one only if the exit occurs from step | with r(y) = |. 
The case »(y) <- 1 can occur if the formula represents a well formed but nonsingu- 
lar tree, .e., if the formula contains two or more singular formulas. The exit 
from step 9 occurs if the indicated dimension of any phrase exceeds the current 
dimension of y and leaves the indicator s at its initial zero value. 


The singular formulas are clearly the meaningful compound statements 
in the system. Moreover, if L, is a singular formula and if L, = 0(L,), 
then E is the left list of a singular tree. The singularity of a given formula 
z can therefore be determined from its associated degree vector d = 0(2). 
The necessary and sufficient condition for singularity of the associated tree 
is simply »(d@) — (+/d) = 1. As shown in Sec. 1.23, the necessary and 
sufficient condition for well formation is that all components of the suffix 
dispersion vector s defined by* 


s=(1+ TD) i (e—4) 
must be strictly positive. The maximum over the components of s will be 
called the maximum suffix dispersion of 2. 


* The suffix dispersion vector describes the dispersion (number of roots) of all suffixes 
of Z, as may be more easily seen from the alternative expression 


S$, = (ai/d) — (+/a/d). 
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5.3 THE MINIMAX FORM OF AN -FORMULA 


Two formulas are said to be equivalent if they have the same value for 
each possible specification of their variables. If z is any formula whose 
operators are all symmetric (i.e., whose operands commute), then any 
reordering of the component singular formulas of z which leaves the span 
of each operator unchanged leads to an equivalent formula. For example, 
since the operators A and vy are symmetric, the formulas 


Z=(A,V,AL VW, ASU 1, 9g, P) 
and G=(AVP VG Ast VOW A,B) 


are equivalent, as may be easily verified. In the tree representation this 
reordering appears as a reordering of the group of subtrees rooted in a 
common node. 

A formula whose maximum suffix dispersion is minimal with respect to 
the set of all equivalent formulas is said to be in minimax form. 

The dimension of the stack vector y employed in the evaluation of a 
formula 2 (cf. Program 5.4) takes on successive values equal to the number 
of roots in the tree represented by the suffix of = currently scanned. It 
therefore assumes the values (in reverse order) of the components of the 
associated suffix dispersion vector s. The maximum dimension of the 
stack vector determines, in turn, the amount of auxiliary storage required 
in the evaluation of the formula or in the compilation of a function 
program for its evaluation. It also determines, in part, the number of 
transfers of intermediate results to and from storage in evaluating the 
formula in a computer having a limited number of central registers. A 
formula in minimax form minimizes the maximum dimension of the 
stack vector and is therefore to be preferred. 

The transformation of a singular formula z to equivalent minimax form 
is based on the following theorem: if each of the 6(z,) component singular 
formulas of @!/z is in minimax form, then the entire formula can be 
brought to minimax form by arranging the component singular formulas 
in ascending order on their maximum suffix dispersion. 

For example, if z= (A, V,A,4,6,A,¢6,d, V,e,f), then 6(%) = 2, 
and a'/z contains two singular formulas, y! = (v, A,a@,6, A,¢,d), and 
y?=(v,e,f), each in minimax form and possessing maximum suffix 
dispersions of 3 and 2, respectively. Moreover, 


q=@) OP Oy =(A, V.6f, Vi, A,G4 A,6 4) 


is an equivalent formula in minimax form, with a maximum suffix dis- 
persion of 3 as compared with a value of 4 for the same function of z. 
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To establish the theorem, let z be any singular formula, let r(z) be its 
dispersion (that is, the number of roots), Iet s(z) be the maximum suffix 
dispersion of z, let ¢ = 6(%,), and let 


assy oy Cy! 
be the unique (cf. Program 3.9) partitioning of a!/z into its component 
singular formulas. Then y/ @ oy Vays) yl t represents a k-tuply 
rooted tree and r(yi GQ yl © - Oy’ A) = k. Moreover, 
s(a!/z) = max [s(y}) trove Gy), ser awror- Oyy] 
= max [s(y') + d _ 1, s(y" O y my wae D y)] 


= max [s(y') +d — 1, s(W*) + d— 2, s(yP Core yD] 


= max [s(’?) + d — FJ 
g=1 


=dt max [s(y’?) — J]. 


j=] 


Since the component formulas are in minimax form, the s(y’) are individ- 
ually minimal, and the maximum over s(y’) — j is clearly minimized by 
arranging the s(y/) in ascending order. This concludes the proof. 

To ensure that each component formula is itself optimal, it suffices to 
apply this reordering procedure in turn to the successive singular formulas 
encountered in scanning the given formula from right to left, as shown in 
Program 5.5. 


Program 5.5, The vector y is the suffix a’/z permuted to optimal form, p is its 
partition vector* (that is, ((p 5 « 1) | «)/y is the jth singular formula of 9), and 
g is its maximum suffix dispersion vector (that is, g; is the maximum suffix dis- 
persion of the jth singular formula of ¥). The main control paramcter / is decre- 
mented on step 11, and, if it is not zero, the degree d = 0(z,) of the next 
component to be added to y is examined. If 2; is not an operator, the branch 
to step 8 occurs with A = 1. The component 2; is then a formula of length 1 
and steps 8-10 add it to y and make the appropriate changes in p and g. 

If z; is an operator (of degree d), the loop 15--22 scans the vector g and reorders 
the first d component formulas of » so that their maximum suffix dispersions are 
brought to ascending order. This is accomplished by the simple, but not necessar- 
ily cfficient, sorting process of comparing successive pairs of adjacent components 
of g and interchanging the corresponding component formulas of y (by rotation 


* The conventions used for p are those established in the subtree partitioning of 
Program 3.9. 
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Program 5.5 Transformation of the formula z to minimax form 


of the infix representing the pair) if an interchange is required. Steps 21 and 22 
effect the corresponding interchanges in the vectors p and g. The Joop is termi- 
nated by the branch from step 17 to step 6, the first ¢ formulas of y (forming the 
prefix a”/y, where 7 = ( +/a"/p)) are then in ascending order on their maximum 


suffix dispersions, and the new formula (2;) 


a”/y is therefore in optimal form. 


Its maximum suffix dispersion is computed by steps 6 and 7 and replaces the 
prefix a“/g (step 8) so that g becomes the maximum suffix dispersion vector of 
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(z,) © y. The partition vector p is respecified by step 9. The one component of g 
remaining at the conclusion is the maximum suffix dispersion of the optimized 
statement y. 


The minimax form of a formula is, in general, not unique. I[t can be 
made unique, however, by using some assigned orderings of the operators, 
literals, and variables (e.g., the ordering specified by the vocabulary 
v= *C)EC) p) as a minor category in the reordering of component 
formulas. Such a unique form is helpful in detecting the occurrence of 
equivalent compound statements within a formula, with the aim of 
obviating repeated segments in a corresponding function program. 


5.4 TRANSLATION FROM COMPLETE 
PARENTHESIS TO LUKASIEWICZ 
NOTATION 


Ordinary parenthesis notation is complicated by the occasional or 
consistent use of certain conventions for eliding parentheses. For example, 
the expression 

(x + (y X 2)) 
may also be written as 
(@@ ty x2) 


by the convention that multiplication takes precedence over addition, or as 
x+y xX 2, 


with the understanding that the entire expression need not be enclosed in 
parentheses. 

The problem posed by the use of such conventions can be segregated by 
considering a complete parenthesis notation in which all implied paren- 
theses are included, i.e., in which each operator and its associated operands 
are enclosed in parentheses. The analysis of a statement in parenthesis 
notation can therefore be performed in two steps, a translation to com- 
plete parenthesis notation according to the prescribed conventions, fol- 
lowed by the analysis of the resulting statement. The present discussion 
will be limited to expressions in complete parenthesis form. 

The complete parenthesis notation will be referred to as -?-notation and 
the terminology adopted for “%-notation will be extended analogously. 
Thus, z=([,Le,+.4,], X*.7.]) is a -*#formula more commonly 
denoted by (« + y) x r. To avoid confusion with the normal use of 
parentheses (e.g., in enclosing vectors), brackets will be used (as in the 
foregoing vector 2) to represent the .7-notation. 
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The discussion will be limited to a system of unary and binary operations 
only (1.e., 0(p) = | or 2) and will again be illustrated by the system of 
logical operations. Assuming a vocabulary of the form v = x (I@ pd 
({, ]), the rules of composition for A-notation may be formulated as 
follows: 


lL. Ifeex OL, then e is a singular A-formula. 

2. If z is any singular formula, if we p, and if (uv) = 1, then (J) @ (vy) @ 
z (J) is a singular -7-formula. 

3. If y and z are both singular formulas, if 6 p, and if 0(6) = 2, 
then (1) Gy © (6) © z © ()) is a singular -7-formula. 


In particular, ([, ¥, A, 2, }) is singular but (y, A, 2) is not; (y) is singular 
but ([, y. ]) is not, and ([, —, y, ]) is singular but (, y) is not. 


1 — 7(®) < € 1-origin indexing 

— Ae, 
2 re ®, | Input in #-notation (terminated by °, Ag). 
3, 4e:1 ®, | Output in “-notation (reversed in order). 
4 wr] = ®, | Auxiliary stack file. 

Set of operators. 
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Program 5.6 Translation from complete parenthesis to Lukasiewicz notation 


The translation from -7- to #-notation can be performed with the aid 
of one auxiliary file or stack vector. Program 5.6 shows a suitable process 
which will correctly translate any singular formula, but which includes no 
tests for singularity. It is noteworthy that all left parentheses are simply 
ignored. A similar process can, of course, be designed to use only the left 
parentheses and to ignore all right parentheses.* Any translation which 
tests for singularity clearly must use all parentheses. If the “- and /A- 
notations employ different sets of operator symbols (e.g., A, Vv, , and 
xX, +,-~), the appropriate translation can easily be incorporated in the 
program. 

Program 5.6. The original statement is assumed to be recorded on a file ,, 
with partitions A, following each symbol and with a null item and partition A, at 


* Octtinger (1960) analyzes three types of parenthesis notation: /eff, right, and com- 
plete. 
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the end. File ?, receives the resulting -formula in reverse order, i.e., from right 
to Jeft. The stack file bs receives each operator symbol as it is read from ®, and 
transfers them one at a time in reverse order (i.e., by a backward read) to ?, at 
each occurrence of a right parenthesis. A trace of the program shows, for 
example, that the Y-formula ([, [.#, V.¥.], 4.6 ~ . J, }) translates correctly 
into the “-formula (7, ¥, V,2, , A) reversed from normal order. 


A partial test of singularity can be provided by testing each component 
for compatibility with its predecessor, the ordered pair being declared 
compatible if and only if it can occur in some singular formula. For 
example, an operator may be followed by either a left parenthesis or a 
variable, but not by a right parenthesis or another operator. These first- 
order compatibility constraints can be expressed in terms of the following 
classes: left parenthesis, unary operator, binary operator, variable or 
literal, and right parenthesis. These classes will be denoted by [, u, 6, v, and 
], or alternatively by 1, 2, 3, 4, and 5. The constraints are summarized in 
the matrix M of Program 5.7. 

The test of singularity provided by the first-order constraints is not 
complete,* but can be completed by the following expedient.f The 
auxiliary file which receives the operators (file ©; of Program 5.7) also 
receives the left parentheses in their turn. The following tests are then 
added: 


1. Each operator is accepted and rep/aces the previous entry in the 
auxiliary file if and only if the previous entry ts a left parenthesis. 

2. The transfer of one component from the auxiliary file to the output 
file normally occasioned by the appearance of a right parenthesis ts 
accepted by the right parenthesis if and only if the component trans- 
ferred is an operator. 

3. The possible exhaustion of the auxiliary file is tested each time it is 
read. 


The first test prevents the acceptance of two successive operators without 
an intervening left parenthesis. At each application of the test, the corre- 
sponding left parenthesis is removed from the file. Since the auxiliary file 
may now contain left parentheses as well as operators, the second test is 
required to prevent their acceptance as operators. The complete testing 
and translation process is described by Program 5.7. 


Program 5.7. The current component -« is read from the input file on step 8 and 
its class & is determined before the repetition of the main loop at step 5. Step 5 


* The tests provided in compilers have frequently been limited to essentially this type. 
See, for example, Carr (1959). 
} This procedure is due to Oettinger (1960). 
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Program 5.7 Translation from complete parenthesis to Lukasiewicz notation 
with full checking of singularity 


determines the first-order compatibility of & with its preceding value j. (The 
singularity indicator s is set to unity only at the exit on step 10.) Each component 
occasions the recording (step 6) of one item on one file—the auxiliary file ®, if 
k .: 4, or the output file ®, if & ~- 4, The item recorded is the current component 
unless it is a right parenthesis. In the latter event, the variable w is first respecified 
(step 20) by a backward read from the stack file. The test on step 21 assures that 
the item read is an operator. 

If the current component is an operator, the previous item recorded on the 
auxiliary file must first be read, compared with “[,’* and discarded. This occurs 
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on steps 15 and 16. The exits on steps 15 and 20 indicate nonsingularity duc to 
the exhaustion of the stack file. Step 9 provides a final test to ensure that the 
stack file is exhausted when the input file becomes exhausted. Since the first 
component of any singular formula must be either a variable, literal, or Icft 
parenthesis, the initial setting of j to 2 on step 4 provides the appropriate initial 
compatibility condition. 


Each of the translation programs considered produces the resulting 
¥-formula in reverse order. This is the order in which it is most easily 
evaluated and, consequently, the order in which the synthesis of a corrc- 
sponding function program is most easily performed. Synthesis may 
therefore proceed concurrently with analysis. The analysis may, on the 
other hand, be completed first, and the output file P, rewound before 
beginning the synthesis. The latter alternative allows the use of separate 
metaprograms for analysis and synthesis, and hence makes lesser demands 
for metaprogram storage. It also allows the application of an intervening 
transformation of the #-formula to some preferred equivalent form. 
However, as shown by Program 5.4, the transformation to minimax form 
also treats the “-formula in reverse order. It can therefore be performed 
concurrently with the translation from parenthesis notation. 


5.5 TRANSLATION FROM LUKASIEWICZ TO 
COMPLETE PARENTHESIS NOTATION 


The inverse translation from Lukasiewicz to complete parenthesis 
notation is, unlike the evaluation of the Lukasiewicz formula, best 
performed by a forward scan. The suffix dispersion criterion of singularity 
must then be applied in the following way. The dispersion of the entire 
statement is assumed to be one, and the dispersions of successively shorter 
suffixes are obtained by subtracting (1 — 0(r)) for each succeeding com- 
ponent «. The suffix dispersion thus computed must reach zero when and 
only when the remaining suffix is null; if not, the statement is nonsingular. 
The translation of Program 5.8 provides complete checking of singularity. 


Program 5.8. The resulting “formula is produced on file ?, in reverse order. 
Each operator encountered is recorded in the auxiliary file together with a preced- 
ing left parenthesis, and it also causes a right parenthesis to be recorded in the 
output (steps 9-11). Each variable encountered is recorded (step 12) in the output 
file and initiates a transfer from the auxiliary file to the output file which termi- 
nates (step $9) only when an operator of degree two ts encountered or (step 13) 
when the file becomes exhausted. In the latter event, steps 14 and [5 are cxecuted 
as a final check on singularity—-exhaustion of the stack file, exhaustion of 
the input file, and the first zero value of the suffix dispersion # must occur 
together. 
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Program 5.8 Translation from Lukasiewicz to complete parenthesis notation 
with complete test of singularity 
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EXERCISES 


5.1 For each of the following logical functions, exhibit an equivalent (i) tree, 
(ii) 7-formula, and (iii) “-formula: 

(a) fC, yD) =r A V2). 

(b) g(y, 2,4, 2) =(w Vy 2 2) A Ce Vv OF =). 

(c) the function of part (b), limiting the operators employed to and, or, and not. 
5.2 Let f(", 4, 2) be a Jogical function of three variables, let q be an equivalent 
formula in -?-notation, and let r be an equivalent formula in “-notation. Write 
programs to determine the intrinsic vector i( f) (cf. Sec. 7.2) as a function of 

(a) the -4-formula q. 

(b) the 4-formula r. 

5.3 Leta =(A, ,0, v, 1,0) 
b=( ,0, v, 1,0) 
c=(A, V,9, Vv. 1,9) 
€d=(A,ViGn 41) 
e=(V.g", 52) 
q=(LL4% VnbA. ot) 
r=(L¢,V.0LA.6 41D) 
Trace the operation of 

(a) Program 5.4 for each of the cases 2 = a, 2 = b, and 2 = c¢, 

(b) Program 5.5 for z = d. 

(c) Program 5.6 for ®, containing q. 

(d) Program 5.7 for ?, containing q and for @, containing r. 

(e) Program 5.8 for ®, containing d and for @, containing e. 


)) 


5.4 Write a program for translating from -¥-notation to “-notation which is 
analogous to Program 5.6 except that it ignores right rather than left parentheses, 


5.5 Write a program to extend the minimax transformation of Program 5.5 to 
the case of an operator set of the form p = p! « p*, where p' and p* are a set of 
symmetric and asymmetric operators, respectively. 

5.6 Write a program which extends the minimax transformation of Program 5.5 
to include ordering on the variables and operators so as to bring the formula to 
unique canonical form, as suggested in Sec. 5.3. 

5.7 Write a program which will recognize all identical singular subformulas 
occurring in a singular “-formula 2 and which will produce a record of the 
associations in some convenient form. 


chapter 


SORTING 


The order in which a set of items is arranged in a large-capacity store 
often has a marked effect on the simplicity and speed of execution of 
algorithms defined on them, and it therefore becomes necessary to sort or 
rearrange groups of items. 

The problem of sorting may be described as follows: given a vector a, 
determine the ordering vector p = 6/(k(a)) and the permuted vector 
c = pja, where k(a,) is a numeric function defined on the components of 
a, and k(a) is the vector defined by (k(a)); = k(a;). The function k is 
called the key of the sorting process, and k(a) is called the key vector 
associated with a. The key function is frequently an index in some set b, 
that is, k(a) = bia. The components of a will also be called items; since 
the vector a is commonly represented in a file, it will also be called a file. 

Most sorting processes determine the ordered vector c = pla without 
explicitly determining the permutation vector p. A sorting process which 
explicitly determines and uses the permutation p is called an address table 
sort. 

Sorting processes fall into two major classes, called seria/- or random- 
access, according to whether the files used to represent the original and the 
intermediate vectors produced are serial-access or not. Random-access 
processes are also called interna/, for they are normally performed in the 
“internal” storage of a computer. Combinations of serial processes and 
internal processes are used, but the two types can and will be described, 
analyzed, and evaluated independently. 


Input Output Name of Process 
Single Single Duplication 
Single Multiple Classification 
Multiple Single Merging (or Merge) 
Multiple Multiple Revision 


Table 6.1 Types of file operations 
176 
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Four types of operations on files are distinguished, according to whether 
one file or several files are used in input and in output. They are shown in 
Table 6.1. A classification (merge) involving m output (input) files is called 
an m-way classification (merge). 


6.1 SERIAL SORTING METHODS 


Copy operations 


A serial sorting process is executed as a sequence of copy operations. 
A copy operation is defined as follows: all items from a given set of input 
files are transferred to a given set of output files, and each item read from 
any input file must be transferred to some output file before a further item 
is read from the same input. 


Input Item Output 
Files Positions Files 


C)— Ls 


Figure 6.2 Copy operation 


Figure 6.2 provides a graphic description of the copy operation. The 
m components p,, Po,..., P,, denote storage for m items, the Ath input 
file may be read to the Ath item position only, and the occupant of any 
item position may be transferred to any one of the output files. Once an 
item is so transferred, the evacuated position may be refilled by the next 
item from the corresponding input file. Any copy operation can clearly be 
performed by serial input files and serial output files, with each item being 
read and recorded but once, and with no auxiliary repositioning of the files. 

A copy operation is also called rank-preserving, since it satisfies the 
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following definition of that term. [The concept of rank preservation and 
its use in the analysis of sorting processes were first introduced by 
Ashenhurst (1953).] Any item may be specified by a pair of coordinates 
(fr), the file coordinate f designating the file to which it belongs, and the 
rank coordinate r designating its rank (i.e., its index) in the file. Two items 
with initial coordinates (/;, r,) and (fy, 72) and with final coordinates (f;’, r,’) 
and (f3', 2’), respectively, are said to be relatable if and only if f; = fg and 
fi = fy’. An operation is called rank-preserving if precedence relations 
are maintained for all relatable items, i.e., 
nS rycen <r’. 

Henceforth the terms merge and classification will, unless otherwise 
specified, refer to rank-preserving merge and rank-preserving classification, 
respectively. A merge in which each input file forms an infix of the output 
file is called a simple merge, and a classification in which each output file is 
formed from an infix of the input is called a simple classification. 

An m-way classification and a subsequent m-way merge together effect a 
rearrangement from a single file to a single file. If the classification and 
merge are both rank-preserving, the possible rearrangement effected is 
restricted. However, a sequence of such orderings using alternate classi- 
fication and merge can effect an arbitrary reordering. In particular, the 
following two important subclasses of such orderings will each be shown 
to suffice: 


1. simple classification and merge, 
2. classification and simple merge. 


A sequence of copy operations of the first type used to effect complete 
ordering on some key is also referred to as a merge sort. A sequence of the 
second type is called a column sort. 


Simple classification and merge 


An infix in a file vector for which the key is a monotone increasing 
(decreasing) function of the rank is called an increasing (decreasing) string. 
The /ength of a string is the number of items it contains, and a maximal 
string is a string contained in no longer string. A file containing a single 
maximal string is ordered on the key. 

For example, the sequence of keys 


1, 3, 5, 8, 4, 7,9 
contains several increasing strings including 1, 3; 1, 3, 5, 8; and 7, 9, but 
it contains only two maximal increasing strings, 1, 3, 5, 8 and 4, 7, 9 and 


six maximal decreasing strings. Henceforth the term string will normally 
refer to a maximal string. 


§6.1 Serial sorting methods 179 


Two files, each containing one string, may be merged to produce a 
single string by selecting at each step the item with the smaller key of the 
two next available from the input files. More generally, if the inputs each 
contain # strings, and if the foregoing process is generalized to produce the 
longest possible output strings, each output string will contain precisely 
one string from each input. If the inputs contain m, and n, strings, re- 
spectively, then # = max (#1), 2.) strings are produced in the output. A 
subsequent simple two-way classification which assigns [7 + 2] strings to 
the first file and the remaining strings to the second, yields the greatest 
possible reduction in the maximum number of strings in any one file. 
Repetition of the merge and classification phases eventually produces an 
ordered file. 

The generalization of the process to an mi-way merge and #-way classi- 
fication is immediate,* the optimum number of strings assigned to each 
output file by the classification process being limited to[# + m]. Referring 
to Fig. 6.2, the m-way merge may be described as follows: those item 
positions containing keys which equal or exceed the key last recorded on 
the single output file are said to be efigib/e, and the next item chosen for 
recording is the eligible item with the minimum key. When no eligible 
items remain, all positions are again made eligible and the process 
continues, initiating another string in the output file. The number of 
output strings produced is clearly the maximum of the number occurring 
in an input file. Figure 6.3 illustrates the process for #7 = 3. The vertical 
strokes in the figure indicate the division into maximal strings and do not 
denote information represented directly within the files. 

In any sorting procedure, the smallest subprocess which treats the entire 
set of items once is called a phase. The smallest subprocess which by 
simple iteration produces the sorting process is called a stage. A stage 
may comprise one or more phases. In the merge sort described above, for 
example, the classification phase and the subsequent merge phase together 
constitute a stage which is iterated until order is achieved; the process 
is therefore called a two-phase merge. The use of a revision operation 
(Table 6.1) permits the classification and the merge to be coalesced into a 
single phase, and the resulting process is called a single-phase merge. The 
single-phase merge requires 7 input files and m output files, whereas the 
two-phase merge requires only (#7 + 1) files—-one input and # outputs 
in the classification phase, and #7 inputs and one output in the merge 
phase. 

The following format will be assumed for the original files in all programs 
in this chapter: the terminal item is a dummy (null) which is not to be 


* This method, commonly credited to Goldstine and von Neumann (1948), was 
presented by J. W. Mauchly in July 1946 in the Moore School lectures (1948), 
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sorted and which is accompanied by a terminal partition A,,,, where / is 
the index origin in use; all other items are separated by the partition A,. 
The final dummy item facilitates the use of the immediate branching 
convention introduced in Sec. 1.22. 


Two-phase merge. The(m + 1) files used will be labeled D1, D,',...,@,), 
and ,?, the last serving as initial input and as input during each classi- 
fication phase. Program 6.4 describes the entire merge. 


Program 6.4. The merge phase is shown on the left and the classification phase 
on the right. The heart of the former is the loop 7-17. Its operation is controlled 
by the logical vectors v and z (of dimension m), which specify the set of exhausted 
files and the set of ineligible items, respectively. An eligible item is one which has 
been read to one of the m item positions p,, is not a dummy, (i.e., null), and 
possesses a key which equals or exceeds r, the key of the item last recorded. 

The merge phase terminates on step 7, when all files are exhausted. Step 8 
initializes the vector z (ineligible item positions) to the value v (exhausted files), 
and step 9 increments the output string count s. Each execution of the subloop 
10-14 records on ®,? the item p; having the smallest key of all eligible items. 
Steps 10-12 select and record the item p; and preserve its key as the variable r. 
Step 13 reads the next item from file /. If this exhausts the file, the branch to 
step [5 is followed. Step 15 adds / to the set of exhausted files and step 16 adds it 
(since the new item p, is a final dummy) to the set of ineligible items. Step 17 
then repeats the subloop if any eligible items remain and otherwise re-enters the 
major loop at step 7. If the files are all exhausted, step 7 branches to step [8 to 
begin the classification. If not, the production of a new maximal string is begun 
on step 8. 

If step 13 does not exhaust the file, it is followed by the decision on step 14, 
which repeats the subloop if r does not exceed the new key, and otherwise adds / 
to the set of ineligible items on step 16. 

The necessary initialization is performed by steps 3-6. Step 3 rewinds all input 
and output files. Steps 4 and 5 perform the initial read from each unexhausted 
file to the corresponding item position and respecify v to indicate any file which 
may be exhausted by the initial read itself. The vector v is itself specified external 
to the process, so that the initial set of input files may be restricted at will. On 
subsequent repetitions of the merge, all files are made available by step 2. 

The classification phase begins by terminating the output file with a dummy 
item and final partition A, and rewinding it. The # input files are also rewound 
to serve as output files in the subsequent #-way classification, and the process is 
terminated by step 20 if the output string count is equal to one. Step 21 redefines s 
as the maximum number of strings to be allotted to the output files in turn, and 
step 22 reads the first item from the input file P,°. 

The output files are selected in ascending order by the index /. The variable /, 
which counts the strings, is initialized on step 25 and decremented on step 30 each 
time a break is indicated by the preceding comparison on step 29. When s strings 
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have been recorded, the branch from step 30 to step 24 is followed to repeat the 
process for the next output file. 

The classification phase is terminated by the occurrence of the partition A, on 
step 28. Step 1 then records a dummy and a final partition on each output file 
(including any which may have received no items), and step 2 resets the vector v 
to zero. 


The merge phase with which Program 6.4 begins is essential, even though 
the original data are contained in a single file, since it also serves to 
determine the string count s needed in the subsequent classification. The 
need for the string count can be avoided by using a classification process 
which merely assigns successive strings to successive files in cyclic sequence. 
This process does not satisfy the definition of a simple classification, and 
it will be given the name string classification. String classification is 
frequently more convenient to use than simple classification, particularly 
in processes such as the single-phase merge to be described next. Two 
successive strings assigned to a given file in string classification can coa- 
lesce to form a single string, but the probability of such an occurrence is 
small, especially in later stages. 


Single-phase merge. The two phases of the merge sort can be coalesced in 
a single revision operation employing m input and m output files. The 
two rows of files @! and ? serve alternately as input and output on 
successive stages. 


Program 6.5. The main subprocess (15-22) differs from the corresponding seg- 

ment (10-17) of Program 6.4 only in the control of file selection, the alternation 
of input and output being controlled by the alternator a, which alternates between 
1 and 2 (step 2) on successive stages. The classification is controlled by the variable 
i, which selects the particular output file on step 17 and which is itself cycled 
through the integers | to m by step 14. When all files are exhausted, the branch 
to step 2 is followed, resetting the vector v to zero, terminating the output files, 
and rewinding all files. The final output is contained in file ?,*. 
Elimination of file rewind. Each of the sorting processes described requires 
a rewind of all files between successive stages. If the files employed are 
capable of backward read, the processes can be modified so as to eliminate 
the need for rewind. Since each file is alternately read and recorded in 
successive stages, each will always be read in a fixed direction and recorded 
inthe other. Since the space needed for recording is not known in advance, 
it is necessary to do all recording in the forward direction and therefore 
to read in the backward direction. The changes required will be illustrated 
by modifying the single-phase merge of Program 6.5. 


Program 6.6. Since the alternate forward record and backward read effectively 
reverse the order of all files on successive stages, alternate stages must assemble 
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Program 6.5 m-way single phase merge sort 


ascending strings and descending strings, respectively. This is achieved by re- 
versing the algebraic sign of the key in statements 20 and 24 (by use of the alter- 
nator a) on even-numbered stages. 

Except when the entire process is terminated by failure to follow the branch 
from step 8 to step 11, the output files are never terminated by a partition, nor 
rewound, but are simply read backward as in statements 13 and 23. To ensure 
that the backward read of a file terminates properly, each is provided with an 
initial dummy item (step 11), and the branch on step 23 occurs on either the 
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terminal partition A, or the permanent initial partition 7. The entire process is 
terminated by equality of s and a — 1 on step 8, which can occur only if s = | 
and a = 2. This pair of conditions ensures not only that the number of strings is 
unity but also that an odd number of stages has been executed, and hence that the 
final output (on ®,”) is in ascending order. 

Since the initial input file must normally be read forward, an exception is made 
through the agency of the variable c. On the first pass only, c = 0, and the 
reads on steps 13 and 23 are therefore forward. The forward read on the first 
pass necessitates a subsequent rewind of the input files, which is provided by 
step 6. 


A commonly used variant of the m-way merge sort (called string- 
doubiing from its behavior in the case m = 2) treats the initial input as if 
the maximal strings contained were each of length one, and therefore 
produces strings of uniform length min the output. These uniform strings 
may not be maximal but are treated on the next stage as if they were, i.e., 
output strings of length m? are produced. In general, the kth stage pro- 
duces strings of uniform length mm", and [log,, n] stages are required to 
order n items. The number of stages does not depend on the initia! number 
of maximal strings, and no use is made of possible inherent order in the 
original array. The comparison operations may, however, be somewhat 
simplified, since the need to test eligibility is replaced by counts of the 
items read from each file or by recording partitions between successive 
strings. If # is not an integral power of m, some of the strings will be 
shorter than the normal length. They can be expanded by dummy items, 
although the use of partitions renders this unnecessary. 


Classification and simple merge 


The classification and simple merge sort is also referred to as a column 
or digital sort, for the successive classification phases are controlled by 
successive columns (digit positions) of a positional representation of the 
key. The behavior of the process is not so obvious as the behavior of the 
merge of the previous section, and a formal proof of its ability to produce 
ordering will be given at the end of this section. Since the process is based 
on a positional representation of the key, it will be convenient to use 
0-origin indexing for all operands. 

Let k be the (nonnegative) sorting key, let d be its digital representation 
in a base 6 number system (that is, (be) ' d =k), and let q = (d), 
where b% exceeds the largest existing key. The complete column sort 
comprises g stages, each stage consisting of a b-way classification 
followed by a b-way simple merge. The classification on stage / is based on 
d, ; (the jth digit of the key counting from the low-order end), and each 


item is assigned to file ®,, where i = d, ;. The simple merge is defined 
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such that the output file has the form ®), ,,..., ®,_,, where the @, are 
the 4 output files of the preceding classification. 


Two-phase column sort. Like the merge sort, the base 6 column sort may 
either be two-phase (classification followed by simple merge and using 
b + | files) or single-phase (using 26 files); the two methods are described 
by Programs 6.7 and 6.8, respectively. 
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Program 6.7. Steps 1-7 constitute the classification and steps 8-14 the merge 
phase. File ®,° is the single input to the classification and the single output of the 
simple merge. The remaining 6 files are denoted by ®;7, i € 16). 

The component of d which controls the current classification (step 6) is selected 
by 7, which scans d from right to left. In the (implicit) specification of d on step 5, 
b serves as an auxiliary variable (cf. Sec. 1.19). When the classification phase ends 
with the exhaustion of ®,° (step 4), the output files are terminated, all files are 
rewound, and the simple merge is performed (steps 10-13) to re-collect on P,° the 
results of the preceding classification. This file is then terminated by step 14, and 
the entire process is repeated from step 2 for the next lower value of 7. The 
dimension of d is specified (by compatibility with e€(g) on step 5) as gq, and q is, 
of course, specified externally. 
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Program 6.8. The alternator a again determines the roles of the two sets of files 
(input or output) on successive stages, but because of the use of 0-origin indexing, 
it alternates between the values 0 and 1. The classification process (steps 7-11) 
differs from that of Program 6.7 only in the control exercised by the alternator a, 
and in the specification of a sequence of input files Dy7, D,7,..., OF _, instead of 
the single file @y°. 
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Program 6.8 Single-phase column sort (output on 6 files) 


The program is deficient in two respects: it requires that the original input be 
on a full set of 6 files, and it produces a final output on 6 files rather than one. 
The first defect may be remedied by the use of an externally specified logical 
vector v to designate the input files which are unused in the original stage. The 
second may be remedied by a final simple merge following the gth stage. This 
could be added as a separate program, but it can be effected more simply by 
replacing g withg + | to ensure that the high order digit dy is identically zero and 
so restricts the final output to a single file. 


Validity of the column sort.* If d is the q-digit, base b representation of the 
key k of an item p, then the (g — /)th pass of the base b column sort assigns 
the item to file ®,*, where i = d,. The subsequent simple merge collects 
the files in the order ©,*, D,7,..., @,,. Consider any two items p, and p, 
with distinct keys k, and k, represented by d1 and d?®, respectively. It may 


* This proof is due to Ashenhurst (1953). 


§6.1 Serial sorting methods 189 


be assumed, without loss of generality, that k, < ky. Since the keys are 
distinct, there exists an integer r such that d,1 < d,? and that a’/d! = 
a’/d®, Thus r is the (0-origin) index of the highest order column in which 
the keys differ. 

In the (a — r)th pass, the items p, and p, are therefore assigned to files 
®,* and ®,* such that h < i. Hence at the conclusion of the subsequent 
merge, item p, precedes item p,. In each subsequent stage the two items 
are always assigned to the same file (since d,! = d,?, s <r), and the two 
items are therefore always relatable with respect to these operations. 
Since each stage is a copy operation, and therefore rank-preserving, the 
relative ordering of all relatable items is preserved, and p, thus precedes p, 
at the conclusion of the process. Thus for any pair of items p, and p, with 
keys k, and ky, such that k, < ky, item p, precedes py in the final arrange- 
ment. 


Vector keys and categories. The column sort is actually based on the 
components of the vector d and only indirectly on the numerical key it 
represents. The process can therefore be generalized to any numerical 
vector key y defined on each item a,. It can be further extended to an 
arbitrary key vector b belonging to the product set 


C= HCl Dee + cl 1 


as follows: on the classification stage based on component B,, the item is 
assigned to file ec’. b,. An obvious generalization of the foregoing proof 
of validity shows that the resulting ordering is that of the product set C, 

The component set ce’ is called the jth category, c° is called the major 
category, ce) ~! the minor category, and the ordering is said to be defined 
on category ce)! within ec? ~*... within c°. For example, a nine- 
column employee number b = (by, b,, by, bs) may be based on four 
categories, the first component b, representing the employee’s one-bit 
payroll classification (hourly or salaried), the second his two-decimal- 
digit department number, the third his two-alphabetic-character job code, 
and the fourth his four-decimal-digit identification number. The column 
sort on the base 6 representation of a numerical key is clearly a special case 
of a vector key in which each of the categories is the set 1h). 

It is frequently necessary to order a set of items on certain subsets of the 
given categories, and on different rankings of the categories. In general, if 
b is a vector key and m is any compatible mapping vector, then a related 
ordering may be defined on the vector d = b,,. If—continuing the previous 
example—it is required to produce a list ordered by employee's department 
within job classification, then m = (2, 1) and d = (by, b,). 

Any infix of b defines a related ordering which is actually achieved at 


190 Sorting §6.1 


some stage of the column sort on b, and the ordering defined by it is there- 
fore said to be contained in the ordering b. If the orderings d°, d',..., d"-! 
are all contained in 6, they can all be achieved at some stage in the ordering 
b, and the total number of passes required may be less than that required 
to achieve the r different orders independently. For example, d® = (by, by), 
d@' = (by, by, b,), and d® = (by, by, bo, bs, by) are all contained in f = 
(by, By, by, By, bz, by) and can be achieved jointly by sorting on the six 
components of f rather than on the ten components of d°, d!, and d?. 

The usefulness of this result is further extended [Ashenhurst (1953)] by 
the following fact: if a given component recurs in an ordering vector, its 
later occurrences may be ignored. For example, the second occurrence of 
by in the vector f (as fz) may be ignored, and f therefore contains the 
ordering d® = (bg, by, by, bs, by) as well. 

The propriety of suppressing later recurrences of a component of an 
ordering vector is easily established. Let 6 be a given ordering vector and 
let d= b,, and c = b,, be two related orderings. Moreover, let d be 
obtainable from c by suppressing all later recurrences of components of b. 
This implies that the mapping vector m is obtained from the mapping 
vector m by the same process, i.e., m = (a/n)/n. If p, and py are two 
items whose (distinct) keys c' and c? agree in all components up to but not 
including the /th, then their relative order is determined by the component 
c; = b,. In the corresponding keys d! and d? it is clear that the first 
component in which the items differ is again b,, and that the same relative 
order is therefore determined by d. Since a third ordering v = 5, is also 
equivalent to the ordering c = b,, if p is also reducible to m, the result 
concerning equivalent orderings can be extended as follows: two orderings 
y = 2,, and w = z, are equivalent if (o/m)/m = (a/n)/n. 


Choice of number base. In the merge process of Program 6.4, the value of 
m may be chosen to suit the number xv of files available, that is, m1 =n — L. 
In the column sort, on the other hand, the number of files required is 
determined directly by 5, the base of the number system representing the 
key. If the choice of 6 is otherwise arbitrary, it can be chosen as 7» — 1 for 
any 1 > 2. However, explicit execution of the base conversion indicated 
by the statement 
(be) | d<—k(p) 


is usually avoided by using the base in which the key is represented in the 
original files. If a base conversion is required, it can be performed once on 
the first stage and the resulting vector d incorporated with the item for use 
on subsequent stages. 

Base conversion may sometimes be inconvenient or impossible as, for 
example, in the case of special purpose sorting equipment devoid of 
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arithmetic units. In this event the stage required for each digit (b-way 
classification and subsequent merge) may be executed as a series of copy 
operations each utilizing fewer than (6 + 1) files. If, for instance, the 
digits are represented in a ranked binary 17-bit code, a series of 1 two-way 
classifications and two-way simple merges will effect the desired ordering 
on one digit of the key. More generally, a group of p successive binary 
digits can be treated at each pass, providing that 2? + 1 does not exceed 
the number of files available. 


Repeated block sort. If a set of items is classified on the high-order 
column of an associated vector key, each of the resulting subvectors can be 
separately ordered and then merged in a final simple merge. Thus each of 
the subvectors forms an infix or b/ock in the final arrangement. Such 
block sorting can be used to distribute the labor among a number of 
independent sorters. 

Block sorting can also be repeated by further classifying each block on 
the next lower-order column of the key. Fora key vector of dimension gq, 
g repetitions of the block sort yield a complete classification, and ordering 
can then be achieved by a subsequent simple merge. However, since the 
number of blocks produced is (except for duplicate keys) equal to the 
original number of items, the use of repeated block sorting is unattractive 
unless simplified by special schemes for keeping record of and controlling 
the numerous blocks. Two such schemes will be discussed. The radix 
exchange sort (Sec. 6.4) is appropriate to random-access storage only: the 
amphisbaenic sort is appropriate to serial files. 


Partial pass methods 


Each of the sorting schemes discussed thus far is constrained to treat the 
entire collection of items at each stage. Partial pass methods obtained by 
relaxing this requirement normally achieve a reduction in the total number 
of items handled, but at the cost of some increase in complexity. The 
partial pass methods gain their advantage by largely obviating explicit 
merge phases. 

The basic column sort gives rise to two partial pass methods of interest, 
the amphisbaenic sort and the partial pass column sort. The cascade sort 
arises from the use of partial passes in a merge sort. 


Partial pass column sort.* This method achieves the effect of one stage of 
a column sort on a base 4 key with fewer than (4 + 1) files by using a 
sequence of partial passes. The method will be illustrated by an example 
[taken from Ashenhurst (1953)] involving four files and a decimal key. 


* Presented by John Mauchly in the Moore School lectures (1948) and treated more 
fully by Ashenhurst (1953). 
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Table 6.9. This table describes the partial pass column sort for reordering on a 
single column of the decimal key (the jth). The parenthetical expression follow- 
ing each file ?, indicates that it contains all items whose /th key digit equals one 
of the enclosed integers. A second pair of parentheses indicates a second set of 
items in the file following, and grouped separately from, the first set. Thus the 
original input file My is described by @,(0, 1, 2, 3,..., 9) or by any permutation 
of the decimal digits enclosed in a single pair of parentheses. 


Output Files 
Su I Fil Pass 
e nput File “racti 
P P Remaining from previous steps Copied Fracllon 
1 Da: CO, t, 2, 3, 4, 5, 6, 7, 8, 9) M,: (0, 2, 4, 7) 1.0 
Dy: (1, 5, 6) 
Dy: (3,8, 9) 
2 D2 (0, 2.4, 7) Py: (1.5, 6) @, 7) 0.4 
D5: (3, 8,9) (4) 
dy! i «) 
3 Dy: CL, 5, 6)(2, 7) Mg: (3, 8, 94) (5) 0.5 
Dy: (0) (1)(2) 
Dy: (6)(7) 
4 Dy (3, 8, 9)C4(5) Dy: (OCD) (3)(4)S) 0.5 
+ (67) (8) 
D,: (9) 
$ Dy: C78) Dy: COICI23)(4)5) (678) 0.3 
My: (9) 
Ds: 
6 Py: (9) Dy: (OM 1)(2)03 6405 (697 18) (9) 0.! 
Da: 
”; : 
Total 2.8 


Table 6.9 The partial pass column sort 


The first step copies each item whose /th key digit is (0, 2, 4, 7) to file P,, items 
(1, 5, 6) to file ®,, and items (3, 8, 9) to file P;. After each step, only the previous 
input file and the next input file are rewound, in this case Py and ®,. In step two, 
items (2, 7) are copied to ,, and therefore fo/low the group (1, 5, 6), as indicated 
by the separate parentheses. Similarly, items (4) and (0) are copied to Py and 9, 
respectively. 

Step three is preceded by a rewind of ®, and ®,. Since items (1) and (2) occur 
in separate groups in the new input file P., they can be copied to @, in separate 
groups (1) (2) as indicated. Similar remarks apply to items (6) and (7) copied to 
®,. The three subsequent steps complete the required ordering, producing in the 
original input file Pg the ordered array (0) (1)...(9). The rightmost column 
shows that fraction of the original file (assuming a uniform distribution of the key 
digits) copied on each pass. The total at the bottom indicates that the expected 
execution time is equivalent to 2.8 full passes. 


The partial pass process of Table 6.9 is described more concisely by the 
0-origin matrix M of Table 6.10. Element M,* specifies the file to which 


§6.1 Serial sorting methods 193 


Digit 0] 1] 0 


Digit 1/2] | 0 


Digit 2|1)2/0 
Digit 3| 3 0 
He (Pat No SE 
Digit 4/1);3] | 0 
~ 4 


Digit 5/2| |3/0 


Digit 6/2} |1! [0 


Digit 7/1)2)1 0 


Digit 8 | 3 1/0 


Digit 9} 3 2/1 Jo 


Cs 


Input |O|)1)/2)3)142 


Table 6.10 Matrix M describing the partial pass column sort of Table 6.9 


items with key digit s are to be assigned in the rth step of the process, for 
r 096) and s €v%10). The eleventh and last row of M specifies the input 
files, ie., M,” is the index of the input file in the rth step. An algorithm 
based upon the matrix M is described by Program 6.11. 


Program 6.11. The subloop 14-18 performs the classification according to the 
element M,*, using file i (specified by M," on step 9) as input. When the file is 
exhausted, the branch to step 6 increments r and repeats the process unless 
r = (M), Equality indicates completion of the /th column and causes a branch 
to step 4 to decrement / and reset r. The comparison on / is deferred to line 13 
so as to follow the termination and rewind of the new input file. The branch on 
step 10 prevents the recording of a dummy item on the original input file. The 
previous input file is rewound by step 8. 


Program 6.11 includes the two-phase column sort as a special case, for 
if (b + 1) files Dy, ®,,..., ®, are available, the matrix shown on p. 194 
specifies a process essentially identical with that of Program 6.7. File ®, 
corresponds to file @,)°, the first column of M determines the b-way 
classification, and each of the 6 succeeding columns specifies the copying 
of one file in the 6-way simple merge onto file ©,,. 

The method of partial passes is frequently used in the sorting of alpha- 
betic data. Variants include the 13 pass-per-character method of sorting 
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0-origin indexing 


®, 


Files (¢ = 0 for initial data). 
As in Table 6.10. 

Positional representation of 
key (r(d) = q). 

Base of representation. 
Column stage = g — j. 
Partial pass stage. 


Current input file. 


Program 6.11 


Legend 


Method of partial passes for # files 
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punched cards* and the procedure described by McCracken et al. (1959), 
p. 312. 


Amphisbaenic sort. The amphisbaenic sort (Nagler, 1959) is a particular 
arrangement of the repeated block sort employing partial passes. For a 
base & key it requires 6 + | files and proceeds as a sequence of classi- 
fications, with a simple merge of the last 6 subblocks produced occurring 
immediately after each classification on the low-order column. Each 
classification allots digits to the available output files according to the rank 
of the file index, e.g., if file Dy is the input, the digits 0, 1,2,...,4 — lare 
allotted to files 1,2,3,..., 6, and if file ®, is the input, they are allotted 
to files 0, 1, 2,4,5,...,6. The files are recorded forward and (except for 
the initial input) are read backward without intervening rewind. The 
subblocks are designated by the key digits on which they have been 
classified, e.g., by 0,1, 2,...,5 — 1, 00,01, 02, etc. The block chosen 
for classification at each step is the one with the smallest designation 
among those not yet merged, the designations being ranked according to 
their values as decimal (b-ary) fractions. Thus block 213 precedes block 
214, which precedes block 22. 

Table 6.12 shows the steps of an amphisbaenic sort on a three-digit, 
base three key using files Dy, ®,, D,, and @,, with the initial and final data 
on file Dy. The input files are designated by asterisks. Thus file Dy is the 
input to step I, and blocks 0, I, 2, are classified to files @,, M,, and ®, as 
indicated. Step 2 classifies the smallest block (0) from file , to files Dy, @,, 
and ®5. Step 3 classifies blocks 000, 001, and 002 to files D,, ,, and ®,. 
The next step merges these blocks to file @,, and the following step begins 
classification of the next smallest block (01) on file ®,. It is clear that if 
the files are read backward, the next block to be classified is always 
immediately available. The general process for a q-digit, base b key is 
described by Program 6.13. 


Program 6.13. 0-origin indexing is used throughout. Each classification is con- 
trolled by the “‘current’”’ vector ¢ (of dimension +), whose components are the 
successive indices of the available output files. Thus c, is the index of the file to 
which digit & is classified (steps 16-19). The current vector ts determined by 
step 10 so as to omit the index / of the current input file. The selection of the 
block to be classified is determined by the vector A (of dimension g), the next 
block to be classified on digit ; being determined by the prefix @/'1/h. Because the 
classifications proceed for increasing values of j and the blocks just produced 
appear last on the files, the selection can be determined by the last component of 
the prefix alone, i.e., by A;. This is done on step 7, where the index / of the input 


* Described in IBM Form 22-3177-2 Sorter Manual, p. \2. 


Step ] 2 3 4 (merge) 5 6 (merge) 7 8 (merge) 9 10 [1 (merge) 
WD 
- 2 
®, * 00 * 000-002 | 000-002, 010] 000-012 | 00-012, 020 000-022 000-022, 10 | 000-022* 000-102 = 
Q, 0 * 000 * O11 * 021 * It It, 100 11* oe 
®, 1 1,01 1,01, 001 1, O1* 1* 1 1, 022 1* * 101 * 
®, 2 2, 02 2, 02, 002 2, 02* 2, 02, 012 2, 02* 2* 2 2,12 2, 12, 102 2, 12* 
h (0, 0, 0) (0, 0, 0) (0, 0, 0) (0, 1, 0) (0, 1, 0) (0, 2, 0) (0, 2, 0) (1, 0, 0) C1, 0, 0) (i, 0, 0) (1, 1, 0) 
i 0 I 0 I 2 1 3 0 2 0 2 
j 0 1 2 i 2 1 2 0 I 2 1 
s (0, 0, 0) (0, 1, 0) (0, 1, 0) (0, 1, 2) (0, 1, 3) (0, 2, 3) (0, 2, 0) 
c (1, 2, 3) (0, 2, 3) C, 2, 3) (0, 1, 3) (0, 1, 2) (0, 1, 3) (i, 2, 3) 
—_——1 _ 
Step 12 13 (merge) 14 15 (merge) 16 17 18 (merge) 19 20 (merge) 21 22 (merge) 
@, | 000-102, 110} 000-112 } 000-112, 120} 000-122 | 000-122, 20 | 000-122* 000-202 | 000-202, 210 000-212 | 000-212, 220] 000-222 
Q, * 121 * 25 21, 200 21* * 221 * 
®, 111 * 122 * 22 22, 201 22* 22, 211 22* * 
®, 2,12, 112 2, 12* 2* 2 * 202 * 212 * 222 * 
h (1, 1, 0) (I, 2, 0) (1, 2, 0) (2, 0, 0) (2, 0, Q) (2, 0, 0) (2, 1, 0) (2, 1, 0) (2, 2, 0) (2, 2, Q) (0, 0, 0) 
i I 2 3 0 3 0 3 I 3 2 
i 2 ! 2 0 1 2 1 2 I 2 - 1 
s (0, 2, 1) (0, 2, 3) (0, 3, 3) (0, 3, 0) (0, 3, 1) (0, 3, 2) 
c (0, 2, 3) (0, 1, 2) (0, 1, 2) (I, 2, 3) (0, 2, 3) (0, 1, 3) 


Table 6.12 Amphisbaenic sort on 3-digit base 3 key 
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Program 6.13 Amphisbaenic sort 
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to the preceding stage determines the vector €'/L°(h) of output indices, from which 
the new input index is selected as the Ajth component. The successive values of h, 
i, j, 8, and c listed in the example of Table 6.12 may be helpful in tracing the 
operation of the program. 

When classification on the last digit is completed, the last subblocks are merged 
(steps 25-30) onto the output file ,. The variable / must then be decremented 
and the component h, incremented modulo 4 (steps 31-32). When h; completes 
a cycle (becomes zero), the corresponding subblocks are exhausted and / must be 
decremented repeatedly until the corresponding h, does not become zero. The 
main process through increasing values of / is then repeated. 

Although fh, determines the input subblock for the classification on digit /, it 
determines the index of the input file only indirectly (step 7) through the vector 
of output files, itself determined by the input file / used in the classification on the 
preceding digit. When / is increasing, the value of / is simply the value from the 
preceding stage. However, when / is decremented (steps 31-32), it is necessary to 
determine the input / used in an earlier classification. A record of the value of / 
corresponding to cach classification / is therefore kept (step 9) in the vector s, and 
is used to redefine / on step 33. 

Each file to be used as output (except possibly file @,) is first closed with a 
partition (steps 12-15) to demark the beginning of each of the subblocks to be 
recorded. Since backward read is to be used, two dummy items are provided so 
that the branch on A, coincides with the read of a dummy item. An extra read 
then disposes of the extra dummy. Any partition recorded on the final output 
file Py before a fully classified subblock (when / = g — 1) would remain in the 
final output. Step 11 prevents this by initializing the index k to | iff =q — | and 
the index of the first output file (that is, cy) is zero. In a similar manner, step 25 
prevents the attempt to copy file , to itself during the merge phase (steps 25-30). 
Step 28 reads the extra dummy item recorded in the partitioning operation of 
steps 12-15. 

All files are read backward except the initial input in the first stage. This 
behavior is controlled by the logical variable f (steps 16 and 20-21). The branch 
on step 20 fails the first time only, allowing f to be respecified as | and rewinding 
file DP). On subsequent stages, the branch to step 23 causes the read of the extra 
dummy partition. 


Cascade sort.* The cascade sort is a partial pass merge sort, with each 
stage proceeding as follows. The strings are initially distributed (unequally) 
among m7 of the # + I available files; an m-way merge to the empty file is 
performed until some input is exhausted; an m— | way merge to the 
newly emptied file is then performed from the remaining inputs, and so on 
to a final two-way merge. The effectiveness of the process depends on a 
particular initial distribution of the input strings. 

Table 6.14 illustrates the process for 190 strings distributed among five 
of six available files as shown in the first row. Succeeding rows show the 


* Due to Betz and Carter (1959). See also Gilstad (1960) and Carter (1962). 
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Stage Distribution of strings 
0 15 29 41 50 55 
l 5 9 12 14 15 
2 1 2 3 4 5 
3 ed 
4 0 0 0 0 | 


Table 6.14 Cascade merge sort 


distribution of strings at the end of the succeeding stages.* The process 
requires but four passes, only the last of which is a complete pass (e.g., the 
last five strings need not be copied in stage 1). 

The power of a merge process may be defined as the (average) factor by 
which the number of strings decreases per pass, i.c., as V's, where s is the 
number of strings whose ordering can be completed in p passes. For the 
given example, the power is approximately 3.7. This surpasses the power 
of 3.0 attainable in a three-way single phase merge sort using the same 
number of files. 

Gilstad (1960) has proposed a variant of the cascade sort (called 
polyphase) in which every phase is an #-way merge, i.e.. each newly 
recorded output enters immediately as input in the following phase. Its 
power is slightly greater than that of the cascade sort. 


6.2 EVALUATION OF SERIAL SORTING METHODS 


Three major factors enter the evaluation of a serial sorting process: 
the amount of program storage required, the number of serial-access files 
used, and the execution time. The first two factors require little analysis, 
and attention will be limited primarily to the third. 

Because the execution time of a serjal sorting process is normally 
determined almost completely by the time required to transfer information 
to and from the serial files, the execution time is assumed to be directly 
proportional to the number of passes of the files required. Each phase 
corresponds to a pass or (as in partial pass methods) to some fraction of a 
pass, and the number of passes per stage is determined by summation over 
the component phases. 

The constant of proportionality relating actual execution time and 
number of passes depends on such factors as the average length of the 
items, the reading and recording rate of the serial files, and (in processes 


* The jth column of the table refers not to a specific file but to that file which 
ranks jth in number of strings. 
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requiring rewinding after each phase) the rewinding speed. Since these 
factors are specific to particular equipment and particular tasks, and since 
the nature of the dependence is obvious, the present analysis is limited to 
consideration of the number of passes. 

Consideration is also given to related orderings (cf. vector keys and 
categories), which may, in the use of column sorting, be achieved more 
efficiently jointly than separately. 


Simple classification and merge 


The number of stages required in the m-way merge sorting methods 
depends on sy, the number of maximal strings in the original file, and on 
m, For, ifs, is the number of (maximal) strings at the conclusion of the 
jth stage, then s,,, =[s, + m]. Since the eventual string length must be 
one, the number of stages required Is given by 


r= flog, Sol 
for which the approximation r == Jog,, So will be used. The number of 
passes is then 2r for a two-phase merge and r for a single-phase merge. If 
the “cost” of the process is assumed to be of the form 


c=(a+ mr = (a+ m) log,, Sy, 


for some constant a, then the optimum choice of m is obtained as the 
solution of the equation 
log, m= !1+a+>m. 


The two-phase and single-phase methods may be compared for a fixed 
number of files as follows. Let n = 2k be the number of files,* let s be the 
number of strings, and let ¢, and /, be the execution times for the single- 
phase and two-phase methods, respectively. Then 

ty 2 1og,,, 8 _ 2 logy) § 


h log,,, 5 log, s 
2log,k 2 logy k 2 log, k 
~ log, (2k — 1) logy (2k — 1) log, (2k — 1)’ 
This ratio increases monotonically from 1.26 at k = 2 to an asymptotic 


value of two, Since it exceeds unity, the single-phase process is to be 
preferred. 


Expected number of strings. Since the original file is frequently specified in 
terms of the number of items 7 rather than the number of maximal strings 


* An odd number # would prejudice the result against the single-phase method, since 
one file would necessarily be left idle. 
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5, itis desirable to determine the relation between # and the expected value 
of s for a random distribution of the keys. 

Let k be a vector of dimension 7 + 1 whose first n components are the 
successive keys of the file and whose last component is infinite, and let 
f=(k < lk), d=(k = Jk), and b= (k.- Jk). 

If, for example, 
k=7 93558214, 
then 
tkR= «~793558214 
f= 1010001100 
d= 0000100000 


b= OLOILOIOOIT. 


Each unit component of f marks the beginning of each maximal 
ascending string in the forward direction (left to right), each unit com- 
ponent of +b marks the beginning of each maximal string in the backward 
direction, and a unit component of d marks each key which duplicates its 
predecessor. Consequently, +/f, +/b, and +/d are the number of 
forward strings, backward strings, and duplicates, respectively. Since the 
relations <<, =, and > are exhaustive and mutually disjoint, the three 
logical vectors are exhaustive and mutually disjoint, 1.e..f vd v b=e. 
andfAd=fAb=dA b=0. 

Consequently 
(+/f) + (+/b) + (4/4) = (hk) = +1. 


Denoting the expected value of x by e(x), it follows that 
e(+/f) + e(+/b) + e(4+/d) =n + 1. 


Symmetry and the assumed random distribution together imply that 
e(+/f) = e(+/b). Consequently, the expected number of strings e(s) is 
given by 

e(s) = e(+/f) = (n + | — e(+/d)) + 2. 


If there are no duplicates, the expected number of strings is approxi- 
mately n/2, and the expected string length is therefore approximately two. 
Assuming a uniform distribution of keys in a range of g values, the proba- 
bility that d; = 1 is clearly I/g for all /¢(n — 1). Therefore e(+/d) = 
(n— 1) +g. 


Classification and simple merge 


The number of stages required in the column sort is equal to qg, the 
number of significant digits in the representation of the key. To facilitate 
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comparison between different number bases, it is convenient to use the 
variable g denoting the range of the key. Thus in any base 5, the number 
of significant digits g required in the key is given by g = [log, eg], or 
approximately by g = log, g. For any pair of bases 6, and 4,, the corre- 
sponding number of stages g, and g, are related as follows: 


q, _ log, g _ log, b, _ log,, by 
go log,, g — log, by log, by 


= log,, by. 


A more practical form for calculation is 


q1 login by 
dz 1okyy Dy 


The foregoing expressions are identical in form to those obtained for the 
number of stages required in the merge sort, but with the range g replacing 
the initial number of strings sy, and with 6 replacing m. Moreover, the 
number of files required depends on 6 1n the same way that the corre- 
sponding merge processes depend on m. This holds for both the two- 
phase column sort (6 + | files) and the single-phase column sort (26 files). 
The analysis concerning the optimal value of m therefore carries over 
directly to the choice of the base 4, the only additional consideration being 
the possible need for base conversions on the key. The comparison 
between two-phase and single-phase processes also applies directly to the 
column sort, with the conclusion that the single-phase method is superior. 

Of the methods discussed, the column sort is the only one which shows 
significant advantages in the joint treatment of two or more related 
orderings. If « = %, and y = 2, are two vector keys, and if w‘//p = a'/q, 
then the ordering defined by the key z,, for r = @'/p will, when applied 
to the set of items ordered on y, suffice to produce ordering on x. The 
total number of columns sorted to achieve the two orderings jointly ts then 
reduced by /. 

More generally, if w’/p agrees with a selected subvector of the prefix 
a//g and ifthe remaining elements of the prefix occur in @’/p (in any order), 
then sorting on the columns corresponding to w‘/p may again be elided. 
More precisely, if there exist integers / and /, and a logical vector u such 
that 

w'/p = u/(a’/q), 
and (w'/p) = (u/(a’/q)). 


then the ordering * can be achieved by applying the ordering w'/p to the 
ordering y, and the total number of columns sorted is reduced by #. It is 
assumed that neither p nor q contains any repeated components, for if 
they do, each of the later occurrences may be suppressed. 
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Since the ordering on « can be performed before rather than after the 
ordering on y, the roles of p and gq may also be reversed, and the case 
showing the larger reduction may be chosen. If three or more orderings 
are prescribed. the foregoing method may be applied to evaluate each of 
the possible sequences of ordering. 


Partial pass methods 


In the absence of a general method for designing a partial pass column 
sort, its efficiency will be indicated only by an analysis of the four-file 
decimal key example of Table 6.9. If g is the range of the key, the number 
of passes is given by p, = 2.8q = 2.8 logy g. This may be compared with 
the value p; = 2 log, g obtained for the straightforward base three column 
sort, which can also be performed with four files. The ratio 


. = (2.8 logy, g) + (2 log, g) = 1.4 logy) 3 = 0.668, 

3 

indicates the superior efficiency of the partial pass column sort for this case. 
The four files can also be used for a single-phase column sort in a base 
two number system, yielding the value p, = log, g for the number of 
passes. Hence p,/py = 0.843, and the method of partial passes is again the 
more efficient. 

In the amphisbaenic sort on n items with a g-digit base b key, 6 + | files 
are required, and the total number of items handled in the classification 
phases is ng. In the merge phases, however, each item is handled at most* 
once, and the total number of passes is therefore less than (g + 1). This 
may be compared with the 2¢ passes required in a two-phase column sort 
using the same number of files. Alternatively, conversion of the key to a 
base [(b + 1) + 2] representation could permit the use of a more efficient 
single-phase merge requiring approximately q log, . ,,. 5 passes. 

The disadvantages of the amphisbaenic sort reside in the more complex 
program required and in the need for frequent reversal of the direction of 
the files, 1.e., from forward record to backward read. The time lost in such 
reversal may be considerable for certain files. 

The power of the cascade sort is, as indicated by theexample of Table 6.14, 
somewhat greater than that of the corresponding merge sort. Its behavior 
is most conveniently analyzed [in the manner developed by Carter (1962)] 
in terms of the difference equation satisfied by the number of strings 
occurring in successive states. The formulation of these equations is 
indicated in Exercise 6.19. Carter provides asymptotic solutions for cases 
of practical interest. 


* If the final output file @, is among the set of output files in the classification on the 
low-order digit, the subblock assigned to it need not be recopied in the merge phase. 
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In addition to its greater complexity, the cascade sort suffers from the 
need for a particular initial distribution of the strings, and from the 
dispersal of the file rewinds (which can be performed concurrently by most 
computers) throughout the process. Moreover, in the event of a computer 
error, a rerun from the last correct input files is much more difficult to 
program than is a corresponding rerun for a straightforward merge. 


6.3 AIDS TO SERIAL SORTING PROCESSES 


Internal sorting normally enjoys a much higher basic execution rate than 
does serial sorting, but for large volume files the limited size of internal 
storage may make serial sorting necessary. The amount of serial sorting 
may, however, be reduced by some use of internal sorting. For example, a 
preliminary internal sort can produce maximal strings whose lengths are 
limited only by the size of the internal store and thus reduce the number of 
strings presented for subsequent serial merge sorting. 

The present section is devoted to methods of reducing serial sorting by 
the auxiliary use of internal sorting. For this discussion, the only knowl- 
edge assumed concerning the internal sorting process is its capacity to 
order a specified number of items. 

Two classes of processes arise, one for aiding merge sorting and one for 
aiding column sorting. The aid to merge sorting is the simpler, since it 
consists merely in assembling long strings by internal sorting before 
beginning the serial merge sort. A serial column sort, on the other hand, 
may be aided by a final internal sort performed after the column sort. 

If k = R(x) is a key vector associated with x, and if m is any positive 
integer, then a serial column sort performed on the key vector 


kl = |k + me| 


produces the vector x° @ x! @ +++ x’, where the vector +’ contains all 
items such that k;1 = /. If the infix vectors x’ are then copied in turn from 
the file, individually reordered on the key 


P= (me)|,k 


and recorded, the resulting file will be ordered on k, Table 6.15 shows an 
example for m = 4. 

If internal storage allows the internal ordering of as many as » items, 
then the reordering of the infixes x’ can be accomplished by an internal 
sort provided that m is so chosen that r(x’) =<" A for all /. If the original 
keys are all distinct, #7 may be chosen equal to a. 

If the sort on Rk! is performed as a base 4 serial column sort, the number 
of stages required is reduced by approximately log, m from the number 
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Original Order Ordered on k? Sets of Dupli- Final Order 
x k kl rR? x k k! Rk cates in k} x Rk Ri R 
x, 7 1 3 x, 2 0 2 | ? { x, 2 0 2 
% 9 2 | x, 3 0 3 | | x, 3 0 3 
x, 2 0 2 x, 7 1 3 x 4 1 0 
x, 11 2 3 x, 6 | 2 | x! | x, 6 | 2 
x, 6 | 2 x 4 | +O x, 7 1 3 
x 4 1 ~«0 x 9 2 1 | ; { x 9 2 | 
x, 3 0 3 mG Ly 23 “ || 11 23 


Table 6.15 Internal aid to column sort (#77 = 4) 


required for a corresponding sort on the original key. The subsequent 
internal sort on k? therefore serves as an aid to the serial column sort. 
The arithmetic operations indicated in the definition of keys k! and k? 
may be simplified if 7 is chosen as an integral power of the base 4 of the 
original key. For, if the vector d is the q-digit base ) representation of k,, 
and if nz = 6/, then o'/d, and w'/d are the base 4 representations of k;! and 
k,?, respectively. The keys are therefore obtained from & by extracting the 
specified columns, and the serial sorting is reduced by exactly ¢ stages. 


6.4 INTERNAL SORTING METHODS 


Since the range of practical sorting methods is clearly broadened by the 
use of random-access storage, internal sorting methods include all of the 
serial processes treated in Sec. 6.1. However, since the use of random- 
access storage introduces certain new problems in the execution of these 
processes, they will be reconsidered before proceeding to methods suited to 
random-access storage only. 

If the available random-access storage is divided into a number of areas 
or fields, these fields can be used in lieu of the serial files. The serial 
sorting methods then carry over unchanged except that the automatic 
self-indexing property of the serial files must be replaced by an explicitly 
programmed indexing of the corresponding fields. 

The efficacy of an internal sorting process depends not only on the speed 
of execution but also on the number # of items which can be ordered, using 
a given storage capacity ¢ (measured in number of items). Since v7 is nearly 
linear inc, this property is measured in terms of the storage ratior = ¢ +N. 

In using random-access storage, the effect of a simple merge or a simple 
classification can be achieved rather easily through address moditication, 
Hence there is little advantage in splitting either the merge or the column 
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sort into two separate phases, and attention will be limited to the single- 
phase processes. The two-phase processes can prove superior if efficient 
block transfer of data is available in the execution of the program, but 
their behavior should, in any event, be clear from the treatment of the 
analogous single-phase processes. 


Simple classification and merge 


An internal single-phase merge analogous to the serial single-phase 
merge of Program 6.5 can be based on the assignment of two matrices |X 
and 2X to correspond to the sets of input and output files. The rows "X’ of 
the matrices “X correspond to the files ®,* of the serial process, each item 
being represented by a single matrix element. Items are read sequentially 
from and to fields; for each input field “X’, an index r, indicates the 
element “X,/ to be read next; for each output field ® “X*, an index s, 
indicates the element next to be specified. The items tn cach field occupy 
the leading positions in the field, and the index of the first unoccupied 
element in the input field “X’ is indicated by the parameter ¢;. The process 
is described by Program 6.16. 


Program 6.16. As in Program 6.5, the vectors rv and x specify exhausted ficlds 
(files) and incligible fields (positions), respectively. The parameters a and ¢ are 
initially specified external to the process. At the beginning of each stage the 
vector v (exhausted fields) is specified (step 3) according to the unit components 
of t. At each stage except the first, t is respecified (step 2) by the final value of s 
from the previous stage. When only one string remains, each component of s, 
save the first, will remain at its initial value of unity, and this condition is used to 
terminate the process at step |. The vector Rk represents the keys of the current 
items in the #7 input fields and is initially specified (step 7) by the keys of the 
column vector “X, of initial items of the input fields) The remainder of the 
process is closely analogous to Program 6.5. 


If the total number of items to be sorted is #, then each of the 2 fields 
«“X’ must accommodate # items, and the total storage allocated must be 
2nm, The storage ratio is therefore 2m. It can be reduced to two by 
putting the output in a single field and keeping a record of the beginning 
location of each successive set of s’ = [s + m] strings, where s is the 
maximum number of strings in any one input field. 

Let the input be represented by the single field X%, let b be a vector of 
dimension m + 1, whose Ath component specifies the beginning location 
of the Ath set of maximal strings, fork = 1, 2,..../, the /th set numbering 
possibly less than s’, and let 6,,, be the location of the first unused position 
of the field. Obviously / < m, and if j< m, the remaining undefined 
components of b are immaterial. If the / sets are to serve as inputs to an 
m-way merge, then the prefix #”/b serves to initialize the vector r of 
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Program 6.16 Internal single-phase #-way merge using 2/7 fields 


current input indices, and the suffix w/b defines the terminating 
locations f¢. 

The strings produced by the merge from the sets of maximal strings in 
X* can be transferred without classification to the single output field 
X?-“" If the vector b is redefined by the beginning locations in the output 
field of successive sets of [s’ + ml] items, it can be used to define the input 
fields in a subsequent merge from the field X* “, Program 6.17 shows a 
convenient arrangement of the process. 
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Program 6.17. As in Program 6.16, the vector r determines the current set of 
items being examined, and ¢ determines the terminal value of r. The first » 
components of 6 initialize r (step 7), and the last #: components initialize ¢ 
(step 8). The index wu determines the component of the output vector X3°@ which 
is next to be specified (step 24). 

The remainder of the process differs from Program 6.16 primarily in the deter- 
mination of b, The counter # allows / and b; to be respecified (steps 19-20) at 
the end of each group of s strings. The last specified component of 6 is deter- 
mined separately by step 5. Step 6 redetermines v for the next stage. 


Since some of the initial values sj, v°, and b° may be unknown, the 
initialization of the process (steps 1-3) merits some attention. If sy) alone ts 
known, v° and 6° may be chosen as follows: vo = @!; b,° = 1, b,° = r(X). 
The effect is to perform the first merge from a single input area. Con- 
sequently, the first stage performs no rearrangement but does determine 
the vectors v and B. 

If s, is unknown, s may be determined by a preliminary string count. 
Alternatively, it may be set to any integer value y = sy. The process 
remains valid, but the required number of stages is increased by |log,, y] — 
[log,, So). 

Since sy cannot exceed the number of items, the initial value s = »(X) is 
always valid, and for an assumed random distribution it exceeds the 
expected value by a factor of two only. If greater initial order is expected, 
it may be desirable to modify Program 6.17 to allow a small initial choice 
of s, accompanied by automatic respecification in the event that it proves 
too small. The modification may be simply made by inserting a branch on 
equality of and m, following step 18. The branch would succeed only in 
the event that the initial specification of s were too small and should lead 
to a process for increasing (e.g., doubling) s and repeating the process. 

The case m = 2 is of especial interest, partly because it leads to signi- 
ficant simplification in the storage allocation, and partly because the 
advantages which large values of m enjoy in serial sorting largely disappear 
in internal sorting. These advantages arise from the reduction in the 
number of stages with increasing m, with a nearly constant time per stage 
due to the fixed reading and recording time for the serial files. In internal 
sorting, the time required for key comparisons and address calculations in 
choosing the minimum key becomes relatively more important and, since 
the key comparison time is an increasing function (frequently linear) of m, 
the advantage may lie with small values of mm. 

The simplification of storage allocation arises in the following way. A 
two-way string classification on n items may be used in conjunction with a 
single output field with a total capacity of n items by assigning the odd- 
numbered strings from the beginning of the field in normal order and the 
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Program 6.18 Two-way internal single-phase merge 


even-numbered strings from the end of the field in reverse order. Thus if 
s’ is the jth string of 2k strings so classified, the output field would contain 
the array 


F@OPFQOHSQ™ DS I1GSS* OS: OSOSOS, 
> < 


where the arrows indicate the increasing directions of the associated 
strings. The restriction to an even number of strings in the foregoing 
example is clearly not essential. The corresponding two-way internal 
single-phase merge is described by Program 6.18. 


Program 6.18. Since the current index vectors r (for input) and s (for output) 
may always be initialized as shown in steps 3 and 4, and since termination of a 
phase occurs when r, exceeds r, (step 7), explicit use of the vectors 6 and ¢ of 
Program 6.17 is obviated. The only added complication lies in the different 
treatment of indices r; and s,, which must be incremented whenever used, and of 
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indices r, and s,, which must be decremented. This treatment is effected by 
addition (steps 16-17) of the rows Q/ and Q/ of the matrix 


‘tO 
Q= 
Oo -!I 
It is interesting to note that the use of indexed variables to allow comparisons 


with the larger (or smaller) of two keys (steps 9 and 11) reduces the requisite 
number of comparisons from four to three. 


The method of string-doubling also permits some simplification in 
storage allocation and address calculations. 


Classification and simple merge 


As in the case of the internal merge sort, the internal column sort differs 
from the corresponding serial sort primarily in the problem of storage 
allocation. Again, the straightforward solution lies in the allocation of 24 
fields of item positions each, and the use of b-dimensional indexing 
vectors r, s, and ¢ to control the input and output fields. The sorting 
process used is identical with that of Program 6.8 (serial single-phase 
column sort), and the indexing problems are analogous to those of Pro- 
gram 6.16 (internal single-phase merge using 2m fields). 

As in the corresponding merge sort using 2m fields, the foregoing process 
has a high storage ratio which can be reduced to two by a two-field process. 
Unlike the corresponding case for the internal merge of Program 6.17, 
the explicit classification process cannot be avoided. Consequently, it is 
necessary to determine in advance the size of field required for each of the 
b classes corresponding to digits 0, 1,...,(/—1). This leads to the 
so-called pre-count column sort of Program 6.19, in which each stage 
incorporates an examination of the next higher order position of the key 
and a count of each of the digits occurring. 


Program 6.19. 0-origin indexing is used, and the vectors X“ and X“ (fora = 0 
or |) serve as input and output fields. The classification on the key digit d; is 
performed so that all items which agree in the /th column of the key form an intix 
in the output X“, and so that the value of d; associated with successive infixes 
increases monotonically. The output indexing is determined by the vector s, 
which is, in turn, initialized by the vector 6. The value of 8 for the succeeding 
stage is determined by steps 13 and 14*, according to the value of the next higher 
key digit d; ,. The initial value of 6 is assumed to be defined externally, It must 


* Statement 14 is, for most computers, an ineflicient procedure for determining b. 
Normally it is preferable to make a simple count of each of the digits and to sum the 
counts to determine s at the beginning of the next stage. 
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Program 6.19 Pre-count column sort 


be determined by a preliminary count, perhaps performed when reading the 
items into the initial field. The use of g + 1 instead of g in steps | and 9 ensures 
that the pre-count quantity d; , is properly defined even for the final stage. 


Special internal sorting methods 


The present section is devoted to internal sorting methods which are 
specifically unsuited to serial files. The storage ratio provides a major 
categorization of methods; a method either possesses unity storage ratio 
or it does not. 

Unity storage ratio is achieved by methods which proceed by the inter- 
change of item pairs. The type of interchange may be limited to the 
transposition of adjacent items, to “insertion” of an item accompanied by 
a movement of all intervening items toward the evacuated position, or to 
the exchange of an arbitrary pair. The corresponding methods are 
characterized as transposition, insertion, and exchange methods, respec- 
tively. 

Exchange methods include the radix exchange sort. This is an arrange- 
ment of the repeated block sort for a base two key, for which the operation 
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count (number of elementary operations required to order # items) ts of the 
order of n logy a. 

Transposition methods include the bubble sort, odd-even transposition, 
and the ranking sort. They are characterized by relatively simple programs, 
an operation count of the order of #? for random initial order, and the 
capacity to utilize existing order to reduce the operation count. 

The only insertion method treated ts ranking by insertion. Thé operation 
count (counting comparisons only and not counting the individual item 
transfers of the block movements associated with each insertion) is of the 
order of ” log, n for random order, and is reduced by existing order. It is 
most attractive in a computer providing efficient block movement of 
items. 

Methods having a storage ratio greater than unity include the merge and 
column sorts previously discussed. One additional method of this type is 
treated—the pth-degree repeated selection sort. The operation count is of 
the order pn’, where g = (p + 1) + p. 

Any internal sorting method can be broken into two distinct phases, the 
first utilizing only the keys to determine the permutation required on 
the items, and the second effecting the permutation of the items. Since the 
permutation vector is, in effect, a table of addresses of the items, the 
process is called an address table sort. Address table sorting ts particularly 
advantageous if the volume of data in the item is large compared to the 
data in its key. 

Any sorting method in which each stage isolates the item with the 
smallest key (among the items remaining from previous stages) can be 
modified to produce longer strings by the use of one auxiliary serial input 
file and one auxiliary serial output file. The modification is called sorting 
with replacement. It consists in recording the selected minimum item in the 
output file and reading from the input file a replacement item which enters 
in the subsequent stages only if it is eligible for continuation of the string 
already recorded. 

The internal methods are evaluated and compared in Sec. 6.5, and the 
results are summarized in Table 6.37. 


Radix exchange. Radix exchange is a form of the repeated block sort fora 
base two key. The high-order column of the key is scanned first. The first 
zero item (item with key digit zero) is exchanged with the last unit item, the 
second zero item is exchanged with the second last unit item, and so on, 
until the first stage of the block sort is completed. The zero items now 
form a prefix in the vector of items, and the unit items form a suffix. The 
process is then repeated on the next column of the key, first on the prefix 
obtained from the first stage, and then independently on the suffix. Each 
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column of the key is treated in turn, the exchange process being performed 
independently on each of the disjoint infix vectors defined by the preceding 
stage. O-origin indexing will be used throughout the discussion. 

Primary contro! of the process is exercised by the vector b, whose 
successive components specify the beginning index of successive subblocks 
(infixes). In particular, by = 0, and 6, , = »(*), and at the beginning of 
the Ath stage, »(b) = 2"! + 1. The storage required for 6 is therefore 
significant but may be reduced by either of two expedients. The vector b 
may be replaced by a logical vector u such that u/i° = b. Determination 
of the next component of b then requires a scan of the components of w. 
The use of wis thus normally unattractive except in a computer equipped 
for the efficient determination of the location of the leading nonzero 
component of a logical vector. The second expedient leads to a more 
complex but more practicable process. The straightforward use of b will 
be treated first. 


Program 6.20. Steps 10-21 perform the exchange on the subblock with indices 
k in the interval 6; « & «<< b;.,. The indices ry and r, designate the pair last 
exchanged, & is the index of the current item examined, / is the current column 
of the key, and a is an alternator which is zero during the forward scan of the zero 
section and unity during the backward scan of the unit section. The alternator a 
determines which of the indices ry and r, will initialize & (step 13), the direction 
of the scan (step 14), the type of key digit (0 or 1) sought in the next item to be 
exchanged (step 18), and which of the indices rg and 1, is to be redefined by k 
(step 19) when the search terminates through failure of the branch at step 18. 
If a does not become zero from negation on step 20, the process is repeated from 
step 13 with a = 1, producing a backward search for a unit digit in the key. Ifa 
becomes zero, both the forward and backward scans have been completed, and 
the required item exchange is performed on step 21. The final exit from the entire 
loop 13-21 occurs when & = r;, that is, when & = rg on a backward scan or 
k =r, on a forward scan. In either event, the final value of & is the beginning 
index of the new subblock defined by the exchange, and it is used immediately to 
specify c; on step 16. The vector ¢ is eventually meshed with 6 (step 6) to re- 
specify b as (Bo, Cy, by, Cy, ...). The initial specification of 6 on step | ensures 
that the first subblock treated is the entire vector «x. 


The number of subblocks which must be distinguished at any one time 
can be reduced to the dimension of the key by a method due to Hilde- 
brandt and Isbitz (1959). The process ts controlled by a partition vector p 
of dimension g + 1, whose successive nonzero components specify the 
beginning indices of the subblocks of x remaining to be ordered. At each 
stage, the first remaining subblock is exchanged on the appropriate key 
digit d,, 1.e., for j increased by one over the value used in generating the 
particular subblock. When the exchange occurs for / = q — 1, the order- 
ing of the two leading subblocks is complete, and they are removed from 
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Program 6.20 Radix exchange with r(b) ~~ 2d) 


further consideration by respecifying p, by p,, and resetting p, and p, to 
zero, where p, and p, are the first nonzero components of a@'/p. The end 
of the new leading subblock is now determined by p,, the new leading 
nonzero component of @!/p, and the exchange is executed on the appro- 
priate column /. 

Record of the value of j appropriate to a given subblock is kept by 
recording its terminal partition as p, ;. This is achieved, first, by recording 
each new partition generated by exchange on column / — | in component 
p, ; and, second, by advancing the component p,. (determined in the 
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Program 6.21 Radix exchange with r(p) = +(d) -- | 


prefix removal phase) to p,_, and resetting p,.to zero. Incidentally, this 
procedure ensures that p,, and p, always occur as p, and p,. In practice, 
these two components need not be reset if the scan for p,,. is begun with ps. 


Program 6.21. The reader may find it helpful to trace the program (i.e., record 
successive values of all parameters in tabular form) for a simple case of a few keys 
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of small dimension, The exchange phase (15--22) is identical with that of Program 
6.20 exccpt that the specification of ¢; is omitted. If fj » g — 1 at the conclusion 
of the exchange, / is incremented (step 10), ry is reset to Py — 1, and the new 
partition & specifies both p,_;andr,. If j = g — 1, the prefix removal is executed 
by steps 4-8. Step 4 respecifies py, and the scan of steps 5-6 (which begins with 
P) locates a nonzero component p,,_; which is advanced to p,_(;, 1, (indirectly by 
steps 7, 10, and 12) and is reset to zero by step 8. Steps O~I provide the initial 
specification of py and (via step 12) of py. 


Bubble sort. The basic operation of the bubble sort is the comparison and 
possible transposition of a pair of adjacent items so as to place the smaller 
of the two keys earlier in the sequence. The first stage consists of such 
operations performed in sequence on the item pairs (¥, ), ¥,), (% 2%. 1). 


Original Original Stage Stage Stage Stage Final 
Items Keys 1 2 3 4 Items 

x, 8 3 3 3 3 Xs 

xy 6 8 4 be FAN | 4 XG 

xy 11 6 8 6 6 *, 

X4 9 1 6 8 8 xy 

Xs, 3 9 11 9 9 xy 

X65 4 4 9 1! 11 Xy 


Table 6.22 Bubble sort example 


. 2.5 (¥), ¥). The result is to bubble each item upward tn the sequence 
until it encounters an item with a smaller (or equal) key and then to leave 
it and continue bubbling the new smaller item. In particular, the smallest 
item is bubbled to the top. Successive stages repeat the process, but since 
the jth stage brings the /th smailest item to the /th position, the (j + 1)th 
stage need not treat the first 7 positions. It is clear that +(x) — | stages 
suffice, but it is advantageous to allow termination at the end of the first 
stage during which no transpositions occur. 

Table 6.22 shows the arrangements prevailing at the end of each stage 
of a bubble sort. The items above the staircase line are not re-examined. 
Although the items are in correct order at the end of stage three, there is 
no available indication of the fact until stage four is executed without the 
occurrence of a transposition. 


Program 6.23. The detailed behavior of the bubble sort process described by 
this program should be elear from the foregoing discussion. It may, however, be 
remarked that at most r = (x) — | stages are executed, even though the final 
order is achieved only at the rth stage. 
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Program 6.23 Bubble sort 


If a given set of ttems is completely ordered except for a single item 
which is displaced upward from its proper position by / places, / stages of 
the bubble sort will be required to complete the ordering. On the other 
hand, a single stage of the bubble sort performed in the alternate direction 
{i.e., scanning from 4, to x, and bubbling the large items downward) 
would suffice. In general, there is some advantage in performing successive 
stages of the bubble sort in alternate directions. 

If on a backward scan (from *, to *,) no transposition occurs between 
items *, and #,,, then *,; and «#,., are in correct relative order. 
Consequently, if *, and (the possibly new) «,_, are not transposed on the 
succeeding forward scan, then #, and #,,, will suffer no transposition. This 
result may be extended to strings of ttems which suffer no transpositions, 
and a record of this existing order can be used to obviate the corresponding 
comparisons. 

More precisely, if s is a logical vector such that s,; = | if and only if no 
transposition occurred between items *; and *,_, ina backward scan, then 
no transposition between items *,_, and #, on the succeeding forward scan 
will also imply no transposition (i.e., no need for comparison) between x, 


and «,., if s; = 1. More generally, if a@ = +1] js an alternator such that 
a = | on the forward scan, then s, may be defined as unity if and only if 
no transposition occurs between x, and x,.,. Program 6.24 shows the 


entire process. This variant of the bubble sort is attractive only for 
computers in which the indicated scan of the logical vector can be per- 
formed efficiently. 
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Program 6.24. Multiplication of each key by the alternator @ provides the 
required alternation in the behavior of the branch on step 10. The behavior is 
best appreciated by tracing a simple case. 


< 
1 — Atm) sk) o 
1-origin indexing 
2 wy —_ No 
3 5 «= al(v(x)) V wiv(x)) x | Vector to be ordered. 
4 (a,j) -—(-1, ) $ | String indicator 
: st G(s) = r(x). 
1 € ; 
° J | Scan index. 
J eee a | Alternator (+1). 
7 pf e he 
= id 
8 fit@#4#it+ (a = 1)(*) Legen 
9 SA Sie tl | = | 
10 53 << (ak(x;) < ak(%),q)) Le 
11 N5<> X jag 


Program 6.24 Forward and backward bubble sort with string indicator 


Odd-eren transposition sort. Like the bubble sort, the transposition sort 
has unity storage ratio and involves the comparison and possible trans- 
position of adjacent items. Each stage consists of two phases or half-stages. 
In the first half-stage, each item with an odd index (except the last if it is 
odd) is compared with its successor; in the second, each item with an odd 
index (except the first) is compared with its predecessor. Table 6.25 
provides an example. 


Original Original Stage Stage Stage Stage Final 
Items Keys 1 2 3 4 Items 
xy 8 6; 6/ 6) 6; 3] 3] 3 Xs 
x, 6 8| 8} 8] 3] 6] 41 4 X¢ 
xy 11 9 9 3 8 4 6 6 xy 
x4 9 11 3 9 4 8 8 8 x, 
x; 3 3} 11 4} 9} 9] 9} 9 xy 
X¢ 4 Ae ae eae | cer Ce lel x, | 


Table 6.25 Odd-even transposition sort example 
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Program 6.26. The subloop 6-10 performs the first or the second half-stage of 
the transposition sort according as the alternator @ is 0 or |. Final termination 
occurs on step 3 as soon as one half-stage is completed without a transposition 
occurring, except that a minimum of two half-stages must be executed. The 
minimum of two half-stages is assured by the initialization of r on step I. 


1 —| 4<2 
7 

2 a~<1 1-origin indexing 

3 t<f-1 bake 

: x | Vector to be ordered. 
4 a<a eee 
t Termination indicator. 

5 j<ca@ a | Alternator (0, 1). 

6 Jojt2 <— j | Scan index. 

sale ji: ve) 

8 K(6;4) + kw) LE Legend 

9 Hj; 
10 t<1 a 


Program 6.26 Odd-even transposition sort 


The validity of the termination conditions of Program 6.26 follows from 
the following proposition: if any half-stage except the first effects no 
transpositions, the items are completely ordered. If the half-stage is odd, 
then 


K(¥,) <= k(%_)3 K(%3) < k(Hy)s 


Since no transpositions occur, the conditions of the previous stage (which 
exists by hypothesis) also remain valid, 1.e., 


K(%) < k("3); A(¥y) << A(H5)5 


The two sets of inequalities together imply ordering.* A similar argument 
applies for the case of an even half-stage. 

In Sec. 6.5 the transposition sort is shown to be less efficient than the 
bubble sort. However, it enjoys the unique advantage that all comparisons 


* This result may be combined with the fact (established in Sec. 6.5) that the number 
of transpositions required is finite to establish convergence of the transposition sort. 
For, if the set is not ordered, each half-stage must effect at least one transposition. 
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and transpositions ina given half-stage are independent and may therefore 
be executed in parallel. 


Ranking sort. If one new item is added to a vector of items already ordered 
on a given key, the resulting vector can be ordered by ranking the new 
item, i.e., by comparing it with the items of the original vector in succession 
until the appropriate rank of the added item is determined. Moreover, 1 
repetitions of this process, which draw the new items from a given vector 
of # items, will order the entire vector. Table 6.27 shows an example 11 


Original | Original} Initial | Stage | Stage | Stage | Stage | Stage | Final 
2 


Items Keys Set 1 2 3 ; 4 5 Items 
x, 8 8 6 6 6 3 3 x; 
My 6 8 8 8 | 6 xy 
Xy 11 Il 9 8 6 x. 
xy 9 11 9 8 x, 
X5 3 Hj 9 xy 
X% 4 I 3 


Table 6.27 Ranking sort example 


which the individual ranking operations are each performed by comparing 
the added item with the ranked items in turn (starting with the largest). 
moving forward by one place each item whose key exceeds that of the 
added item. 


Program 6.28. The index / controls the selection of successive items to define 
the item z (step 4) which is to be added to the ranked set by steps 6-9. The index / 
controls the selection of successive items of the ranked set for comparison with =, 
and each execution of the subloop terminates if either 7 becomes zero or if 
kK(z) ~ k(x;). In either event, step 9 inserts the new item = into the position x; ,, 
which was last evacuated. From the initialization of the index / it is clear that the 
process is actually completed in v(x) — 1 rather than (x) stages. 


Ranking by insertion, Since each stage of the ranking sort ranks one new 
item in an already ranked set, the determination of its position can be 
accomplished by a binary search. This sharply reduces the required 
number of comparisons. However, once the new position is determined, 
each of the succeeding items in the ranked set must be moved down one 
place to make way for the new item to be inserted. This method is par- 
ticularly good where (due either to suitable facilities in the computer or to 
the use of a chained representation for the vector of items ¥) such block 
transfer is easy to perform. 
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Program 6.28 Ranking sort 
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Program 6.29 Ranking by insertion 
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Program 6.29, The binary search is performed by the loop 11-15, which so 
determines / that the new item x; is to be inserted after x;. Since the floor opera- 
tion is used on step 12, / will eventually reach the value p, and step [4 therefore 
terminates the loop in a finite number of steps. Step 3 performs the required 
insertion by a right rotation of the infix of x bounded by x; and x;. The cases 
where x, lies outside the previously ranked set are treated by the comparisons on 
steps 7 and 9. Incidentally, step 7 takes full advantage of any initial order in the 
items, e.g., if the set is initially ordered, steps 8-15 are never executed. 


Repeated selection sort. The process of scanning all items of a vector for 
the smallest key and transferring the selected item to a separate output 
area (in a serial or random access file) will be called selection. Repeated 
selection on the successive remaining (unselected) items will serve to 
transfer the items in the order determined by the specified key. This 
method of ordering is called a (first-degree) selection sort. 

If the given vector of ” items is divided into m subvectors of at most 
[7 = ml] items each, then a selection from each subvector will produce a 
vector of m items, the /th of which is the smallest of the jth subvector. A 
selection performed on the m items of the resulting vector will then select 
the smallest item of the entire original set. If the selected item came 
from the Ath subvector, it is then replaced by a further selection on the 
remaining items of the Ath subvector. Repetition of the process # times 
serves to order the items. Because selection is performed on two levels, 
the process is called second-degree selection. 

In general, the smallest (first-level) item may be selected from a set of v, 
second-level items, each of which is selected as the smallest of v, (hird-level 
items. The process can clearly be extended to any desired number of levels. 
If p levels of selection are used, the process is termed pth-degree selection or 
repeated selection. It may be represented as a singular homogeneous tree 
T of height p + 1, as illustrated by Fig. 6.30. 

Figure 6.30 shows the initial filling of the lower levels in a third-degree 
selection sort performed on the sixteen items at the top of the tree T, with 
v(T) = (1, 2, 2, 4). The keys are indicated in parentheses. The positions 
of the third level are the nodes (1, 1, 1), (1, 1, 2), (1, 2. 1), and (1, 2, 2). 
They are first filled by items ¥,(6), %4(2), ¥(1), and #,,(4), respectively, each 
selected as the smallest among the second-level nodes of the corresponding 
subtrees Ty, 14). Tia.2 Tao, and Ty... respectively. The first position 
of level two is then filled by x,(2), selected as the smallest among the second- 
level nodes in its subtree, and so forth. Figure 6.31 shows the continuation 
of the process through the selection of the first two output items. 

If e is some value which exceeds the absolute value of all keys, then the 
selection process may be made more uniform by assuming either that the 
entire top level of the tree is occupied by the items to be sorted or that 
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Level Level Level Level 
1 2 3 4 
x1(9) 
x9(14) 


x3 (6) 
. wg (6) 
x4(16) 
x6(2) 
¥543)- 
x62} 
xe (2), x5 (3) =?) 
xg (13) 
x9efh 
Xyo(11) 
x9), x12(5) xn ®) 
1245) 
x9(1) 
: x49(15) 
P x44) 
x14(4) vis(12) 
x16 (10) 


Figure 6.30 The tree T representing the third-degree selection sort for v(T) = 
(I, 2, 2, 4) 
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x9(14) 
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x4 (16) 


x6(2), x5(3) < 
X5(3) 
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x7 
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11 
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x9(1), x14(4) x12(5) 


*13(15) 
x14(4) 

x15(12) 
x16(10) 


*14(4), X16(10) 


Figure 6.31 Continuation of the third-degree selection sort 
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unoccupied positions are filled with items having the key value e, and by 
replacing each item selected from the top level by an item with a similar 
key value. Termination of the process may then be determined by the 
appearance of such a key value at the output. Moreover, the initial filling 
may be simplified by filling all lower levels with dummy items having key 
values of (—e). These may be recognized and discarded at the output. 
The normal process will remove all the dummy items first, leaving all 
levels in the state which would be produced by a special initial fill. 


Program 6.32. The top level of the tree T initially contains the items to be 
sorted (completed if necessary by dummy items having the key value of e), and 
the remaining levels contain items with the key value (—e). The index vector s 
scans the second-level nodes of the subtree T51,, (steps 14-18) to determine the 
index m of the node having the minimum key z. If z # e, step 10 replaces the 
root of the subtree by the selected node value and step || respecifies s to begin 
the scan of the subtree rooted in m. If z = e, then all second-level nodes contain 
dummy items with “infinite keys,’’ and step 9 branches to step | to replace the 
root of the subtree by a dummy item as well. The branch from step !2 occurs 
when the scan of the top level has been completed; it also results in the insertion 
of a dummy item. 

Since each complete scan (over all levels) begins with s = (1, 1) (steps 3, 11, 15), 
the resulting minimum item is brought to the root of the tree. Step 2 specifies = 
as its key, and steps 4 and 7 determine its disposal. If z = e, all legitimate items 
have been flushed from the tree, and the branch to step 5 terminates the output 
file, rewinds it, and ends the process. If z = —e, the item is a dummy initial fill 
and is discarded by skipping the recording of the output file on step 8. 


Since the selection process proceeds by levels in the tree, a corresponding 
computer program can best be based on a right list—specifically, on the 
right list node vector €?/JT and the dispersion vector v(T). The computa- 
tion of the list index r(s) required in the path tracing is described (for 
0-origin indexing) by the recursion on the functions f and g developed in 
Sec. 1.23. This recursion yields a relatively simple computer program for 
a general homogeneous tree. It will be shown, however, that a b-way 
rooted tree (i.e., a rooted tree with a common branching ratio 5) is the case 
of greatest practical interest, and in this case the simpler recursion 


r(s) = 6 x r(w'/s) + 1 +5, 


(also developed in Sec. 1.23) can be used. Program 6.33 shows the repeated 
selection sort of Program 6.32 executed on the right node list vector r of a 
b-way rooted tree. 


Program 6.33. The initial conditions are as assumed for Program 6.32, and the 
steps of the two programs correspond very closely. The simple index modifica- 
tion required from stage to stage is shown in step II. 
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Program 6.32 Repeated selection sort 


The superiority of acommon branching ratio is demonstrated as follows. 
The number of items scanned per item selected (ignoring initial fill and 
termination) is clearly the sum of the branching ratios, whereas the maxi- 
mum number of items accommodated in the top level 1s equal to their 
product. It is therefore easily shown (e.g., by induction or by the use of 
Lagrange multipliers*) that for a fixed number ofitems, optimum execution 
is furnished by a common branching ratio. 


Sorting with replacement. 


Certain of the internal sorting processes 


discussed (bubble and repeated selection) proceed in a succession of 
stages, each of which results in the selection of the smallest remaining 


* See, for example, Margenau and Murphy (1943), p. 205. 
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item. Since this item can be transferred immediately to an output area or 
serial output file, the evacuated position can be refilled by an item from a 
serial input file. Each output item can therefore be replaced by a new item 
from a serial input, and the resulting process is called sorting with replace- 
ment. 

If the key of the newly introduced item exceeds or equals the key of the 
last output item of a group, the new item may be treated as a member of 
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Program 6.33 Repeated selection sort of Program 6.32 executed on the right 
list node vector r of a uniform rooted b-way tree 
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Program 6.34 Repeated selection sort with replacement 
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the original group. If not, the item must be allowed to retain its position 
but must be excluded from consideration in the sorting process. In general, 
sorting with replacement allows the production of longer output strings 
than would be otherwise possible with the same internal storage capacity. 
The expected increase is discussed in Sec. 6.5. Since the process terminates 
with the original positions completely occupied by ineligible items, the 
production of a further string can be begun simply by declaring all items 
eligible. 


Repeated selection sort with replacement. In the repeated selection sort it is 
advantageous to apply the eligibility criterion at each level, i.e., to limit 
selection to keys which equal or exceed the key of the item being replaced. 
The item being replaced is, of course, the last one transferred (either to the 
output file or to the preceding level). The top level items are replaced from 
the input file or, when the file becomes exhausted, by dummy items. 
However, the use of the ‘infinite’ dummy key value e as in Program 6.32 
raises serious difficulties, which are avoided by the use of the value 
—(e + 1). This is done in Program 6.34. 


Program 6.34. This program is very similar to Program 6.32, and only the 
essential differences will be remarked. The main scan loop (15-20) differs only in 
the added comparison with w to prevent the selection of ineligible items. The 
variable w is normally specified (step 13) as the key of the item just transferred 
out of the position being filled. However, if all items are ineligible, then z 
remains unchanged from its initial value established by step 14, and the branch 
from step 21 to 22 occurs. The variable w is then set to —e to make eligible all 
items except the dummy fills [with key value (—(e + 1))], which enter on exhaus- 
tion of the input file. If only these dummies remain in the level scanned, the 
process returns again to step 22, This time, however, the branch to step 3 occurs. 

Step 3 is the file read (which replaces step | of Program 6.32). When the file 
becomes exhausted, the branch to step 2 replaces program statement P3 by p,, so 
that step 3 thereafter provides the requisite dummy keys. 


Bubble sort with replacement. A straightforward bubble sort with re- 
placement produces the same length strings as a first-order selection sort 
with replacement, and, indeed, differs from it mainly in the additional 
performance of item interchanges. Bubble sorting with replacement is 
therefore of interest only if the order induced in the remaining items by the 
interchanges can be used to reduce the number of items scanned. This can 
be achieved by accumulating the ineligible items in a growing suffix of the 
vector of items and restricting successive scans to the prefix of eligible 
items. This method is shown in Program 6.35. 


Program 6.35. In the loop 11-13, w denotes the current item with the smallest 
key, and it is interchanged with x, if k(w) exceeds A(x,). At step 14, w is therefore 
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Program 6.35 Bubble sort with replacement 


the item with the smallest key among those scanned and (unless &(w) = e) is then 
recorded in the output file and used to define z, the key of the last recorded item. 
The main loop is then repeated from step 5, where w is respecified by reading 
an item from the input file. If the new item is eligible, the branch from step 6 to 
step 10 is followed; if not, the parameter j (denoting the index of the first in- 
eligible item %;) is decremented and w is interchanged with the last eligible item x; 
before continuing to step 10. Since step 10 initializes / to the value j, the sub- 
sequent scan is limited to eligible items. 

If 7 becomes zero on step 7, the entire set of items including w is ineligible. 
Step 8 then restores j so as to make all items eligible and hence to start a new 
string. 

The branch from step 5 and the subsequent redefinition of step 5 on step 2 
serve to introduce “‘infinite’’ keys after the input file is exhausted. The immediate 
redefinition of j and z (steps 3 and 4) to make al) items eligible may appear 
redundant. It may be verified that this procedure (1) does not increase the number 
of strings produced, and (2) avoids the potential error of omitting the last string. 
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Address table sort. A sorting process can, in principle, be accomplished in 
two distinct phases, a determination of the permutation required and the 
execution of the permutation. In serial sorting, however, the process 
cannot be effectively divided in this way. For, because of the limitation 
to rank-preserving operations, the items could not be transferred directly 
to their final positions even if they were known. In internal sorting this 
limitation does not apply. A sorting process embodying these two separate 
phases is called an address table sort. The method offers advantages when 
the time required to transfer a complete item is large compared to the time 
required to transfer its key. 

The permutation p must be so determined that the permuted vector 
y = %, is ordered on the key. Let K' be the vector of keys associated with 
the given vector «, i.e., K,' = k(x,), and let K? be the identity permutation 
t'. For example, if the successive keys of the vector * are 17, 9, 6, 11, 4, 8, 


and 3, then 
4 
K=(" 9 6 II 8 ‘} 
123 45 67 


If the columns of K are reordered (by any desired sorting process), to 
produce the matrix P such that P,) is monotone increasing in é, then 
p = P? is the desired permutation vector. In terms of the foregoing 


example, 
3.4 6 8 9 It 17 
p= : 
75362 4 #1 
and p = (7, 5, 3, 6, 2, 4, 1). 


6.5 EVALUATION OF INTERNAL SORTING 
METHODS 


In evaluating internal sorting methods, both the execution time and the 
internal storage requirements must be considered. For the execution 
times of the internal merge sort and the internal column sort, the analysis 
of the corresponding serial sorting process applies directly. Since their 
storage requirements have also been discussed in Sec. 6.4, the present 
section will be limited to the special internal sorting methods. 

The measures of interest in the evaluation of these methods are four: 


the scan length 

the number of stages required 

the number of transpositions required 
the storage ratio. 


heel ea 
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The number of comparisons executed is frequently used as a measure of 
execution time of a sorting process. However, since the selection of the 
keys to be compared (involving instruction modification and possibly 
extraction of key digits) usually accounts for the largest share of the time 
required in a comparison, the scan /ength (number of items scanned) is a 
more suitable measure. In a process which scans m items serially (as in 
each individual stage of a bubble or a ranking sort), the number of com- 
parisons is (# — 1) and differs but little from m. However, for m small 
(as in the successive selections in a repeated selection sort); the ratio 
m + (m — 1) is significant, and the scan length is the better measure. 
Moreover, in a process which does not use a serial scan, the number of 
comparisons may differ markedly from the scan length. For example, 
each half-stage of the interchange sort requires approximately m/2 
comparisons for a scan of m items. 

The storage ratio is defined as the ratio of the number of item storage 
locations required to the number of items entering the sorting process 
(without replacement). The four measures will be determined for each 
method in turn and then compared. Table 6.37 summarizes the main 
results for an assumed random initial distribution of keys. The entire 
analysis is based on the assumption of distinct keys. 


Expected number of transpositions 


Let y(n) be the total number of transpositions required to order all of 
the n! possible arrangements of n items. Since the (” + 1)th item added 
in the last position may rank first, second,..., or last, requiring vn, 
(n — !1),..., or QO additional transpositions, respectively, for each of the 
n! arrangements of the n items, then 


y(n + 1) =(n+ Dy) + at ¥ k 
k=0 
= (n + Iy(n) + (S)on + 1)!. 
The function 
nl (n? — n) 
4 


satisfies this difference equation as well as the obvious boundary condition 
y(0) = 0, and the expected number of transpositions is therefore given by 


y(n) = 


y(n) — (n= 1) 

n! 4 
The maximum and minimum number of transpositions are (n? — n)/2 and 
zero, respectively. 


§6.5 Evaluation of internal sorting methods 233 


A pair of items whose relative order differs from the final arrangement 
will be called a disordered pair. Thus in the set (2, 4, 3, 1), the disordered 
pairs are (2, 1), (4, 3), (4, 1), and (3, |). To establish order it is clearly 
necessary that each disordered pair be transposed. The sequential 
transposition of disordered pairs of adjacent items is also sufficient to 
produce ordering. Any transposition method which transposes only 
disordered pairs therefore achieves order with a minimum number of trans- 
positions and hence with an expected number of (#7? — »)/4. The ranking 
sort, bubble sort, and interchange sort all fall into this category. The 
sequence in which the transpositions occur will, however, normally vary 
for different methods. Thus the sequence of arrangements realized in 
reordering the set (2,4, 3,1) is (2,4, 1,3), (2,1,4,3), (1, 2.4.3), and 
(1, 2, 3, 4) for the bubble sort, and (2, 3, 4, 1), (2, 3, 1, 4), (2, 1, 3. 4), and 
(1, 2, 3, 4) for the ranking sort. 


Bubble sort 


In the bubble sort, both the number of stages and the scan length depend 
on the initial order. The minimum and maximum number of stages 
required are one and (# — 1), respectively. The minimum and maximum 
scan lengths are # and (7? + n — 2)/2, respectively. 

The expected number of stages required for a random initial order ts 
determined* as follows. If, at any stage, all items are ordered except for 
one which occupies a position below (later in the sequence than) its proper 
position, then one further stage will complete the ordering. On the other 
hand, each item which appears above its terminal position at a given stage 
will be moved down by exactly one position in the next stage. Conse- 
quently, the number of stages required will be determined by d, the 
maximum upward displacement from its final position of any item in 
the original array. More precisely, if « is the given vector of items, y the 
corresponding vector of ordered items, p is a permutation such that 
y =*,, and d =! — p, then d = ((e[d)/d),. 

If the maximum upward displacement is d, then (assuming that all 
keys are distinct and that the final order is consequently unique) the last 
item in the final order (1.e., y,,) can initially occupy any one of the (/ + 1) 
components of the suffix w'~!/*, Similarly, y,,., may initially occupy any 
one of the remaining (¢ + 1) positions of the suffix w‘°?/x, and so on for 
each of the (n — d) items of the suffix w” “/y. The number of possible 
initial arrangements of the items of w"~“/y is therefore (d + 1)". Since 
the d leading items (i.e., «”/y) can occupy any one of the d remaining posi- 
tions without restriction, the number of possible initial arrangements with a 


* The exact expression for the number of stages is due to Friend (1956). 
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n c(n) 

l 0.798 
5 0.853 
10 0.877 
15 0.893 
20 0.904 
30 0.918 
40 0.927 
50 0.934 


Table 6.36 Coefficients for 2(”) = c(n)V an/2 


displacement not exceeding d is clearly d!(d + 1)". The probability 
that the maximum displacement r does not exceed d is therefore given by 


(n-d) yy 
pr(r Sf) dt 
n! 
Similarly, a | 
peda Se 
n! 
and hence 


pr(r = d) = pr(r < d) — pr(r < d — 1) 


= (d + pyor-@ d\— deta = 1)! 
n! ; 


The expected value of r is given by 


n-1 


n-1 
fo ee BUS a eK Oe) 
a= 


a=0 

n-1 n-1 
= 1S d(d + 1)" d!— > dx dir~¢+lq = it). 
n!\@=1 aa 
A change of dummy variable in the second summand (d = ¢ + 1!) brings 
the two summands to similar form and yields the result 

e, =n — 2(n), 
where 2(n) = ah s sash 
nls=1 

It is shown in the appendix to this chapter that 2(n) approaches the value 


V mnj2 for large n, and Table 6.36 gives coefficients c() for the approxi- 
mation 


(n) = c(n)V rn]2. 
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Since one extra stage is needed to determine that the ordering has been 
completed, the expected number of stages is given by 
é, =nt 1 — xn). 


Since the scan length in the jth stage of the bubble sort is known to be 
n—j + 1, a similar analysis can be used to determine the expected total 
scan length e,. The result is 


n—n—2 
= + (n+ I)e(n + 1) — ne(n), 
or approximately, 
iis —n—2 ei ee, 

Ranking sort 

The number of stages in the ranking sort is clearly (2 — [). The ex- 
pected scan length on the Ath stage is determined as follows. The item may 
rank in any one of (A + 1) positions; first, second, ..., last, with equal 
aa There are therefore (k + 1) cases requiring scan lengths of 
2,3, ...,k, (A + 1), (A + 1). The last case requires a scan length of 


(A + 1) eather than (Ak + 2), since the process terminates on comparison 
with the last item regardless of the outcome of the comparison. The total 
scan length for the (A + 1) cases is therefore 


a kK? + 5k + 2 
r} — 1 = ———_ 
(>, 2 
and the expected scan length is consequently 2 + k/2 — I/(A + 1). 
Summing over the (7 — |) stages and denoting the expected total scan 
length by e yields the relation 


n-] n-1 


e=2(n—1)+ k— — 
3 ea 


_w+In— 4 eal 
4 rk 

But 

s Dieaped ay ees 

meu. f 2n 127 
approximately,* where y = 0.5772: ++ is Euler's constant. Hence 

2 

a elope aoe 
2h 12h" 


* See, for example, Cramer (1951) p. 125, or Woods (1926) p. I71. 


236 Sorting §6.5 


correct to two decimal places for alla > 1. The maximum and minimum 
scan lengths are (mn? + n — 2)/2 and 2(n — 1), respectively. 

The ranking sort takes advantage of initial order and the minimum scan 
length of 2(n — 1) is achieved for an initially ordered set of items. Ranking 
by insertion with binary search, as described by Program 6.29, requires 
approximately 2 + [log, 7] comparisons on the /th stage. Hence ifn = 2°, 
the number of comparisons required is given by 


e=(2+ kj) — (2 4+ 27? 4+ 23 +--+ 4 2-4 
= (2 + k)2* — 2* = (k + 12". 


In general, then, 
é = n(logyn + 1). 


For a random distribution, ranking by insertion with binary search 
requires fewer comparisons than any other method, and, in the form 
described by Program 6.29, also takes advantage of initial order. The 
insertion operation requires, of course, a number of rotations of relatively 
lengthy vectors. 


Odd-even transposition sort 


Estimates of the efficiency of the transposition sort may be obtained as 
follows. Each half-stage requires the scanning of (approximately) n items 
in n/2 comparisons, and results in reducing the displacement (either up or 
down) of each item by at most one. The fact that the reduction in the 
displacement may be zero for certain items can be illustrated with the 
initial arrangement (5, 6, 1, 2, 3,4). The number of half-stages must 
therefore equal or exceed the maximum displacement, which, in turn, 
equals or exceeds the maximum upward displacement d used in the analysis 
of the bubble sort. Moreover, one final half-stage is required to determine 
that order has been achieved, and the expected number of half-stages will 
necessarily exceed the corresponding value obtained for the bubble sort, 
namely, n + 1 — z(n). Since the number of items scanned per stage in the 
transposition sort exceeds the corresponding number in the bubble sort, it 
follows that the transposition sort is much less efficient. Moreover, the 
transposition sort does not allow sorting with replacement. Its sole 
advantage resides in the possibility of executing all operations in a given 
half-stage in parallel. 


Repeated selection sort 


The number of items scanned per item selected in a pth-degree selection 
sort is equal to the sum of the branching ratios of the associated tree, and, 
as already demonstrated, a minimum scan length is provided by a tree 
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with a common branching ratio m. Since m? > n, the number of items 
scanned per output item is given approximately by 


5 = mp = m log,, A. 


The resulting expression for the total number of items scanned (am log,,, #1) 
is similar in form to the corresponding result for the m-way merge sort. 
The indicated optimum choice of m is Napier’s number e. 

The optimum integral value of m is three, and its efficiency differs from 
the theoretical optimum by less than 1%. The value m = 2 simplifies the 
required program and increases the expected amount of scan by only about 
5%. This case (m = 2) is referred to as a tournament sort. Larger values of 
m may prove advantageous where the groups of items to be scanned are 
contained in a serial store whose scan time is not significantly reduced by 
reducing m. 

Since p item transfers are required per item selected, a more realistic cost 
function for determining the optimum value of m may be given by the 
function 

c=(m + ajp = (m F a) log,, 4, 


where a is the ratio of the time required for an item transfer to the time 
required for the scan of a single key. As remarked in Sec. 6.2, the optimum 
value of m is obtained as a solution of the equation 


log, m=14+". 
m 


The amount of item storage required for a pth-degree selection sort 1s 
given by 


+ 
| 


e=l+mtmert+ec-t+m= 


m— | 
mn 


m—l 


since n= m?, The storage ratio r(m) is therefore given by r(m) = 
m/[(m — 1), a function which decreases with m, rapidly approaching unity. 
This ratio also represents the increase in execution time entailed in the 
initialization of the lower level positions. If sorting with replacement is 
used, initialization is required for the first string only. 

The ratio r(m) changes significantly for the first few values; thus 


r(2) = 2.00; r(3) = 1.50; (4) = 1.33. 


If the expected scan time s(m) is taken as a second criterion, then (since 
s(2) = s(4) > s(3)), the value m = 2 will be eclipsed by both 3 and 4. 


Item Transfers or 


etek ald Stages Scan Length Transpositions Storage 
orting Metho: Ratio 
Expected Max. Min. Expected Max. Min. Expected Max. Min. 
nH n n ie 
Merge (m-way) log, 5) log,” 1 Palekam 3 nlog,, 2 n nilog,, 3 nilog ,,7 n 2 
Column (q digits base 5) q q q ng ng ng ng ng ng 2* 
2 2 2 
*—n—2 on? = 2 : Be 7 
Bubblet n+i-2m|n-1) 1 | SOP gator tpn ; 7 +e n eS 4 5) t 
4 
n—n—2 Fs nin 
Interchange? n+1—2(n) n 2: 7) +t Det + 1) — a2) n> 2n 7 0 I 
F n+7n 1 ntn—2 n—n ron 
Ranking n—1 wT na 7 1.6 — log, a — on Y 2n —2 7 5} 0 I 
Ranking by insertion (binary search) n—1 n—1 |n-1 n@logyn + 2) nloggn + 2)| 2n — 2 cera HE oe dj 0 rt 
4 
lp-th degree selection? § n n n nmp = nm 108i, 0 nmp nmp np np np au 
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at us" Sst oy nn/2 
Wee 


* With optimum allocation (2 fields). 
t Replacement possible. 


= Parallel operation possible: estimates are lower bounds. 
§ All values exclusive of initial fill. 


Table 6.37 


Characteristics of internal sorting methods (for random distribution of distinct keys) 


8ET 


8unsos 


$'°9§ 
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Sorting with replacement 


The use of replacement with any internal sorting method offers the 
advantage of increasing the expected length of the strings produced. 
Gassner (1958) has shown that for v item positions and random keys, the 
expected Jength of the first string is (e — 1) X m= 1.718”, and that for 
later strings the expected length rapidly approaches 2n. 


Comparison of internal sorting methods 


The results of the preceding sections are summarized in Table 6.37. The 
ranking, bubble, and transposition processes show the most favorable 
storage ratio, exactly one half of the best attainable in the merge and the 
column sort processes. In the case of repeated selection, the storage ratio 
depends on m and ranges from two to a limit of one. 

The execution time is approximately proportional to the function 


f(y =stect, 


where s is the scan length, ¢ the number of item transfers or transpositions, 
and c the ratio of the cost (in time) of one item transfer to the cost of a scan 
of one key. For any given method with variable parameters (such as the 
repeated selection sort), these parameters may be chosen so as to minimize 
J(n). The choice between various methods may then be made (subject to 
storage considerations) so as to minimize f(n). 

Since the ranking, bubble, and odd-even transposition methods share 
the same number of transpositions, the choice between them depends on 
the scan length and auxiliary factors. The odd-even transposition sort is 
inferior to the bubble sort in this regard, and the bubble sort is, in turn, 
inferior (by a factor of two for large n) to the ranking sort for n => 8. The 
bubble sort retains the advantage that sorting with replacement may be 
used, and the transposition method allows parallel operation. 


APPENDIX TO CHAPTER 6 


The following derivation of the limit of the function 


a(n) = z Se 


n! s=1 


arising in the evaluation of the bubble sort was suggested by Robert 
Church. Clearly 


L(n) = ia a(s)ds =< 2(n) < U(n) = [" a(s + 1) ds, 
0 


“0 
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where g(s) = — s”-*. Moreover, 
a! 


U(n) = ig (: + “) x6) ds, 


and since g(s) is monotone increasing with a large positive derivative for 
large values of s, only the upper end of the integral is significant. It can 
therefore be shown that 


lim U(n) = lim L(n). 


Consequently, 
lim L(n) = lim 2(n). 


nD n> oO 


Applying Stirling’s formula, 


2 /Dap st+léj—son-s d n 
: TTS e 'S RY = 1g ly 
lim L(n) == I gee =e"n (nt A) ght eds, 
a -a/2an"?e-™ 0 


Setting ¢ = 1 — s/n yields 


nD 


1 
Fi & 14 Lg woo 
lim L(n) en (ntl) i Were tite Norn di 


n7ws 


0 
: nt19 nt 
=n] (l—?f) e” dt, 
0 
' Ug) lo 
=n emttint a) BeA-t) py 
0 


Expanding log, (1 — ¢) yields 


lim L(n) =n ie eR (nt PAF OB) ae 
n> 20 0 
For n large, only small values of ¢ will be significant, and all terms in the 
exponent may be dropped except —(m + 1/2)°/2. Similarly, the upper 
limit of integration may be extended to infinity. Hence, 


lim L(n) = nf eT Mt NE Ay n(af(2n + 1)! 
0 


n>D 


== (mn/2)%. 
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EXERCISES 


6.1 A given file contains a set of 15 items ,,..., %,5 with associated decimal 
keys 68, 9, 60, 14, 60, 73, 79, 15, 67, 5, 9, 41, 57, 9,41. For each of the serial files 
used, show the contents at the conclusion of each stage of the following processes : 
(a) a two-phase classification and merge using 
(i) four files. 
(il) three files. 
(iii) six files. 
(b) a string classification and merge using four files. 
(c) a single-phase merge using 
(i) four files. 
(ii) six files. 
(d) a single-phase merge without rewind, 
(e) a base ten column sort using eleven files. 
(f) a base ten column sort using twenty files. 
(g) a base ten column sort using four files and the partial pass column sort. 
(h) a column sort using four files and a base three representation of the keys. 


6.2 Modify Program 6.4 so that it will work properly without dummy terminal 
items (i.e., each terminal partition A, is to be associated with a legitimate item). 
6.3 Write a program for the string-doubling merge sort. 
6.4 (a) Write a program for a base 6 column sort which uses backward read to 
eliminate rewinding. 
(b) Program a variant of the two-phase column sort (6 + 1 files) so as to 
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eliminate the copying of the “‘zero items” in each merge phase. Deter- 
mine the relative efficiency of the method. 

(c) Program an analogous variant of the m-way two-phase merge sort so as 
to eliminate the copying of part of the items in the classification phase. 


6.5 Construct the matrix M (cf. Table 6.10) specifying an efficient partial pass 
column sort for the following cases: 
(a) base ten and three files. 
(b) base ten and five files. 
(c) base eight and three files. 
(d) base ten and four files using no rewind, i.e., files are to be read backward. 
(e) base ten and three files using no rewind. 


6.6 Using a set of matrices 1M, °M,...,’M, of the form of Table 6.10, write a 
program to generalize the partial pass sort to the case of a mixed radix key. 


6.7 (a) Reprogram the amphisbaenic sort (Program 6.13) so as to reverse the 
order of the final output. (This is the form used in Nagler, 1959.) 
(b) Program a generalization of the amphisbaenic sort which makes use of 
partial passes within columns of the key. 


6.8 (a) Program a modification of the bubble sort process which on odd- 
numbered stages bubbles the smallest item to the leading end and on 
even-numbered stages bubbles the largest item to thc tail end. 

(b) By examining all 4! cases show that for four items the eapected number 
of stages is slightly less for the method of part (a) than for the un- 
modified bubble sort. 

(c) Program a bubble sort using a string indicator s as in Program 6.24 but 
using backward scan only. 


6.9 (a) Program a modification of Program 6.17 to specify sy = 2 and to auto- 
matically double sq and restart if necessary. 
(b) Compare the efficiency of the program of part (a) with that of the 
straightforward program in which the number s of strings is assumed 
equal to the number of items. 


6.10 (a) Derive the relation log, #7 = 1 + a/m which must be satisfied by an 
optimal value of m in order to minimize the cost function ¢ = (m + a) 
log,, 4 (cf. Sec. 6.2). 
(b) Determine the optimal integral value of m for each of the cases a = 0, 
1, e. 
6.11 For the amphisbaenic sort on a set of 5% items with distinct keys belonging 
to the set 1°(b4), determine 
(a) the number of item transfers required. 
(b) the number of file reversals (change of direction of read and record) 
required. 


6.12 Write a program describing the odd-even transposition sort in terms of 
vector operations so as to show clearly the parallel nature of the process. Treat 
all items with odd indices (plus a dummy item) as a vector and all even items (plus 
a dummy item) as a second vector. 
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6.13 (a) For each of the following sorting methods, determine whether initial 
order of the items is preserved, i.e., whether the relative initial order of 
all item pairs with duplicate keys is preserved. 

(i) simple classification and merge. 
(ii) classification and simple merge. 
(iii) ranking sort. 
(iv) bubble sort. 
(v) odd-even transposition. 
(vi) radix exchange sort. 
(b) Prescribe a simple modification of the key which will ensure the preser- 
vation of initial order. 


6.14 For the sequence of keys given in Exercise 6.1, show the explicit execution 
of the following internal sorting processes: 

(a) bubble sort with replacement (six item positions). 

(b) pth-degree selection sort with replacement, with p = 3 and m = 2. 

(c) ranking by insertion. 

6.15 If the key is represented in a base 6 number system, with each digit repre- 
sented in turn in a ranked binary code, then ordering can be achieved by a base 
two column sort on successive columns of the underlying binary representation. 

(a) Show more generally that ordering can be achieved by a base 2! column 
sort on (the base two value of) successive groups of & binary digits. 

(b) Program the process suggested in part (a), including the determination of 
an optimum value of & for a given number x of available files. Assume an 
original key of g digits, each represented by r binary components. Do not 
neglect the problem of terminal conditions. 

(c) If b = 10,r = 4, and if the successive decimal digits are coded alternately 
in a (ranked) bi-quinary and qui-binary system, the binary digits can be 
grouped in twos and threes so as to allow column sorting with a maximum 
of five output files. Program a suitable process. 


6.16 Program a sequence of rotations of infixes of the vector * which will 
reverse the order of its components. (See the Ranking by insertion program 
(6.29) for the case when the key defines a complete reversal of order.) 
6.17 Assuming that an item transfer takes ¢ times as long as a comparison of 
two keys, determine a criterion for the choice of in an m-way internal revision 
merge for the following cases 

(a) assuming 2» comparisons per item (#7 comparisons for eligibility and 

for minimization). 
(b) assuming that a ranking sort is used on the 7 item positions. 


6.18 (a) Let z bea vector key of dimension three whose successive components 

represent department number, job number, and name, respectively. 

Two lists are to be produced, one ordered by name within department 

within job and the other by department within job within name. Deter- 

mine a mapping vector p such that y = Zp is the vector key of least 
dimension which contains the two required orderings. 

(b) Lets” = Spi be a set of vector keys defining a sct of related orderings. 


244 Sorting 


Determine a vector key of minimum dimension which contains all of 
the related orderings for the case p! = (3, 4, 1, 5), p? = (3, 1, 5, 6), and 
p® = (1,4, 3, 5). 

(c) Analyze the effect of related orderings on the expected execution time 
of a merge sort. (Consider the effects of duplicate keys on expected 
string lengths.) 

6.19 Let D be a 0-origin matrix of the form of Table 6.14 which describes the 
cascade sort, i.e., row D/ describes the distribution of strings at the completion of 
stage j. Using the special matrices of Sec. 1.13, write concise expressions for the 
matrices F and B such that 

(a) Di = BD! 

(b) Di =F EDM 

(c) Show that F and B are inverse. 

(d) Determine the dominant eigenvalue of B when »(B) = 3, and show its 

relation to the power of the cascade sort for four files. (cf. Sec. 6.1 and 
Carter (1962).) 


6.20 Determine the relative efficiencies of serial column sorting and serial merge 
sorting for the following conditions. Internal sorting, with a maximum of 100 
item positions, is to be used as an aid to each of the processes, and the time for the 
internal sorting is assumed fixed. There are 10,000 items with 4-digit decimal 
keys, and each key value is associated with at most four items. The initial arrange- 
ment contains 3500 maximal (increasing) strings. 


6.21 Program an address table sort. 


6.22 (a) The determination of the permutation vector required in the address 
table sort can be considered as a mapping from each item onto its rank 
in the set. Show that for distinct keys this mapping can be performed 
by counting for each item the number of items having a smaller key. 

(b) Program the method of part (a). (This is known as a counting sort.) 


6.23. (a) Program a two-phase internal merge sort. 
(b) Program a two-phase internal column sort. 


6.24 Program an extension of Program 6.33 (pth-degree selection executed on 
the right list node vector €°/]T) to 

(a) sorting with replacement. 

(b) the case of a singular homogeneous tree with dispersion vector w(T). 

(c) cover both cases (a) and (b). 


6.25 Ifthe transfer from a serial file can proceed concurrently with other opera- 
tions, it is frequently advantageous to associate two fields of internal storage 
(called buffers) with each file and to transfer the next group of items to one of the 
fields while executing necessary operations on the items of the other. Buffers may 
be used similarly for output files. 
(a) Program an m-way single-phase merge using two buffers for each of the 2m 
serial files. 
(b) Program a base d single-phase column sort using two buffersfor each of the 
26 serial files. 


Exercises 245 


6.26 If the number of input buffers serving files is reduced to m -+ 1 some 
advantage may still be gained by “predicting”’ the file whose (currently) associ- 
ated buffer will next be exhausted, and initiating a transfer from it to the idle 
buffer. Repeat parts (a) and (b) of Exercise 6.25 for (a + 1) and (6 + 1) input 
buffers, respectively. (See Friend, 1956.) 

6.27 Repeat parts (a) and (b) of Exercise 6.25 with the number of output buffers 
also reduced as in Exercise 6.26. 

6.28 Since a given initial arrangement may be easier to bring to descending than 
to ascending order on the keys, and since a final reversal of order may be easy to 
achieve (by backward read in the case of serial files or by address modification in 
the case of internal storage), it may be advantageous to choose ascending or 
descending order according to some estimate based on the initial arrangement. 
Write a program which first counts the number of ascending strings and then 
executes a ranking sort by insertion to produce either ascending or descending 
order according to which appears to be the easier to achieve. 

6.29 For the first few values of », compute and compare the following alterna- 


tive evaluations of the expected number of stages in a bubble sort 
n gr—sgt 


(a) n + 1 — 2(n), where 2(n) = >. 
sol 


n! 

(b) n + 1 — Van/2. 

(c) c/n!, where c is the total count of all stages required for the n! possible 
initial arrangements of n distinct keys. 
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THE LOGICAL CALCULUS 


The present chapter develops two fundamental areas of symbolic logic: 
canonical forms and the basic procedures of decomposition. Q-origin 
indexing is used throughout. 


7.1 ELEMENTARY IDENTITIES 


Certain elementary identities will first be summarized for reference. 
The first of them (equation 7.1) merely defines a matrix of operators 
employed in equation 7.4. 


®= (\ 2 ) (7.1) 

Ale = V [x (7.2) 

De Morgan’s laws as a ple (7.3) 
a = Xe Y (7.4) 

Z|* = 2| +/ (7.5) 

f[e = (" , ¥)A(# SY) (7.6) 

#(X = (Xf ¥) A(X Fy) (7.7) 
Z=(4; oN (qrir=HaGrqgr(ehr) (7.8) 

V/V Ie 5 9) = (v/s) A (VY) (7.9) 

AIA L(® 7 9) = CA/®) A CAL) (7.10) 


Identities 7.2, 7.3, and 7.5 may be established by induction on the 
dimension of «. Equation 7.4 summarizes the sixteen identities obtained 
by extending equations 7.2 and 7.3 to arrays. For example, if ©,” and 
©,’ are the operators # and A, respectively, then equation 7.4 becomes 
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The foregoing relation may be verified as follows: 


(Xx : Y;" — a(x’ A Y,) 


=/(X’ \ Y,) (by equation 7.3) 
= =/(X! v Y,) (by equation 7.2) 


Equation 7.7 is a direct extension of equation 7.6, which is itself derived 
as follows. Since the operator :£ is associative and commutative, then 


f/18 = CAMY/8)) A (Al(y/4)). 


Moreover, 
[yle = 2 \(+/y/*) = 2 \(+/(*% A ¥)) = Al® A Y), 


the first and second and the third and fourth limbs being related by 
equation 7.5. Consequently, 


Al® = (CAM® Sy) ACAI A y))) 
= (x - 9) #(# fy). 
The following argument establishes equation 7.8. By definition, 
Zi = (4; A €) A (9; A 15) 
= (a, A q,) A (€; A ¥,) 
= (a Aq); A (CA 1);. 
Consequently, Z=(€AqQ)?2(CA fr). 
Equation 7.9 is obtained by noting that if M=«*-: y, then M, = 
x A yje. Then 
(v//M); = v//M; = (v/*) A 9, 
and (v//M) (vier y. 
Finally, Viv//M = (v/*) A (v/y). 


The derivation of equation 7.10 is similar. 


7.22 CANONICAL FORMS 
Intrinsic vector 


Any function defined on a finite domain can be specified by listing each 
possible value of the argument together with the corresponding function 
value. For a logical function of # variables, the m arguments may be 
considered as the components of a logical vector * of dimension n, and the 
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domain of the function is then represented by the (rows of the) matrix 
T(n) of dimension 2” x n defined as follows: 


LTk=k, 


For n = 3, for example, T has the form shown in Table 7.1. 


= 


if) | p=Th« 


Ky A A 
%y A XA 
Ky A x A 
Xo A x A 
xy A % A 
Xo A ¥, A 
Xy A XA Xy 
Xo A A x2 


l# ele el 


wow Ww oe 


2 8 
nm ot 


—eFe Ke COCOCSO 
—eFH COrFKHco 
—-orcr oro 
ooor-or or 


f(s) = (f) 7 


(%p A * A %2) V Bo A A ®) V (Xp A ¥ A % 2) 


ll 


Table 7.1 The disjunctive canonical form 


A logical function f can therefore be specified by its intrinsic vector i(f) 
defined by: 
i(f) = fT. 


Table 7.1 shows i( f) for the function 


S(®) = (% A A %) V (Xp A HA %g) V (% A 1 A %) 
= (%) V %) A Xp. 


Applying the usual notation for operations on variables to operations 
on functions as well (e.g., f denotes the function inverse to f, and f A g 
denotes the conjunction of the functions f and g) permits the expression 
of certain easily derived identities concerning intrinsic vectors: 


i(f) =i(f) 
fv g)=i/f) v i(g). 


More generally, the intrinsic vector of any function of functions is the 
same function of their intrinsic vectors. 

The two trivial functions identically one and identically zero will be 
called, respectively, the wait function and zero function, and will be denoted 
by l and 0. Thus i(1) = e, and i(0) = €. 
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Characteristic vectors 


A vector which represents a function fis called a characteristic vector of 
jf. The intrinsic vector i(f) is but one of several useful characteristic 
vectors. 

The expression y / * denotes the function obtained by first negating 
each x, for which y, = 0 and then taking the conjunction (that is, and 
function) over the resulting vector. Such a function is called a minterm in 
x, and the components of the minterm vector 


p= T 2x 


comprise all possible minterms in * with no repetitions. 
The component p, is a function of « which assumes the value one if and 
only if x = T’. Consequently, for any function /, 


f(*) = V/GP)/P) 
V/G) * Pp) 
= i(f) * p. 
This relation is illustrated by Table 7.1. 
The expression 


II 


f(s) = if) p= if): (Ts) 


is called the disjunctive canonical form of the function f since it displays fas 
a (unique) disjunctive (that is, or) function of minterms, each of which 
involves all variables and conjunction and negation only. The disjunctive 
is one of several canonical forms of interest, each of which assumes the 
form 


f(*) = y(f)23 8(%), 


where the characteristic vector y(f) is a function of f only, and the specific 
vector (x) is a function of the argument ¥ only. Each of the four forms of 
interest is characterized by the particular over-all operator ©, occurring 
in it and is therefore called the disjunctive, conjunctive, exclusive disjunctive, 
or eguivalence canonical form* according to whether ©, is the operator 
V,A, 4, Or =. 

The characteristic vector and the specific vector appropriate to each form 
will be characterized by its over-all operator ©O,. Thus 


(8) = YF, Oy) 53 8(®, Oy). 


* The functionse V y,x A y,(@ 4 y), and (x = y) are, except for the trivial functions, 
the only associative commutative functions of two variables. 


T cn |] WL) Tis una TVS WA) Tix Y=) T\e iP) |v} vay | yh | r= 
0 0 1 1 XA AX 1 1 LALAIL 1 ovovo 0 0 0 0 6 
00 0 0 Ky AX AX 0 1 LALA x 0 OV OV x, 1 1 1 1 0 
Oo 1 1 1 By AX) A Xp 1 0 PA wy Al 1 Vx, V0 0 0 ty) 0 1 
o1 0 0 By AX, A XQ 0 0 LA x, A xy 1 OV V x i 1 1 0 1 
1 0 1 1 Ky AH AB 1 Ky V XV Xe 0 xy A TAL 1 Xy VOVO 0 0 0 0 1 
0 0 SALA %g 0 1 
1 x Ax AT 0 

Ko NB A Ry 


( Gg A ® Ay) VCO) V Gey A my A BQ) V CO) V Cy Ay A) VCO) V CO) V (0) (Disjunctive) 


Ba sate (1) A Oy Voy V8) A CDA Og Vy VF) ACD) A Gig Voy VQ) A Gig VV %Q) A Ory VX, V ®y) (Conjunctive) 
S(%) = Gp Vm) A By = 4 
aie , | (C1) 4 Gg) 4 (CO) # CO) / (0) ¥ (CO) 7 (%q A ¥y) 4 Cg A 4 ¥)) (Exclusive disjunctive) 


| as, at as ee PLLA Se 
LI) = @) = () = 0) = CG) = (1) = (py V 8) = (%p V&V Q)] (Equivalence) 


Table 7.2 Intrinsic, characteristic, and specific vectors 


0Sz 


SNJNIIDI [0180] ay 


7L§ 
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The forms are defined formally by the following expressions. * 


yf, V) i (Fx) (Disjunctive) (7.1 1a) 
by 4) 6 (PL 8) (Conjunctive) (7.1 1b) 
LON Ws #)%(T x) (Exclusive disjunctive) — (7.11c) 
Ly(f, =) ¥ (T i ®) (Equivalence) (7.11d) 


Table 7.2 shows the intrinsic vector of the function f of Table 7.1 
together with corresponding pairs of characteristic and specific vectors. 
These may be substituted in equations 7.11 (a-d) to verify that they do 
represent the function /- 

Since « = T* = T, for some k, equation 7.1la may be written as 

LN =S(TY) = 1G VAT YT). 
Consequently, 


YHAVO(T. T= YG AYP SA) (7.120) 
YS A) A(T. T) =, 4) * SCA) (7.12c) 
LY =) v (To T) = ¥(f =) 7 S(=) (7.12d) 
Each of the matrices S(©) appearing in the right limbs of equations 
7,12(a—d) is a fixed function of T and is called the ©-specific (e.g., dis- 
junctive specific) matrix. Since i(f) is a function of y(f, ©) and S(o), the 
relation between the intrinsic vector and each characteristic vector is 
determined by the corresponding specific matrix. 
Since S(v) = (FT 4 T) =(T *T), it is clear that S(v) =f. Conse- 
quently, y(f, v) ¥ S(v) = y(f v), and therefore, 


py va Ps: T) = yf, V) < S(v) (7.12a) 
| 
| 
| 


yf Vv) = i(f). (7.13) 
Similarly, S(A) =f, and y(f, A) 4 S(A) = y(f, A), and again 
vf, A) = i(f). (7.14) 


An explicit expression for $(4) may be obtained by induction on the 
dimension of the corresponding argument, and, to facilitate this, the 
notation T(m) and S(>, n) will be used for the matrices appropriate to an 
argument x of dimension n. T(m + 1) may be written in partitioned form 
as 

T(n + 1) soe C me | 
e T(n) 


* The expression TS x used here for the specific vector in the disjunctive form is 
equivalent to the expression T A x used earlier. [ts use increases the uniformity of the 
expressions for the canonical forms. 
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where € and € are both column vectors. Hence 


€ T(n) ,| € '¢ 
S(A,n + 1) = foes a Nie Sceenecl ae tae 
e. T(n) T(n)| T(n) 
_ : A S(#,n) EA sem) 
EA S(4,n) EA S(A,n) 
sinceee S€=ece=E, andeve = E. Finally, 
SCA, n) SCA, ) 
SAn+)= on on hae 
E (SCA, n) 


11 
Since S(4, 1) = ; : it is clear that 


(7.15) 


ba itt ee 

01010101 

P1111 00110011 
0101 00010001 
SD) 0. 04a Poe oe OL a a 
lo 001 00000101 
00000011 

00000001 


and so forth.* 

The following useful properties of S() are easily verified for the fore- 
going examples and may be established generally by formal induction. 
The matrix is self-inverse with respect to the operations (%), that is, 


SA) MAHL (7.16) 
Moreover, since every row of the transpose S(#) save the zeroth has an 
even number of ones, a 

(2) |-+/S(4) = €°. 
Hence by equation 7.5, - 

#/S(4) = e°. (7.17) 

Since (by equation 7.4) S(=) = S(#) and since S(+) is of even dimension, 
the same result holds for S(=). 


* This result was first obtained by Muller (1954), who employed the matrix C of 
binomial coefficients and showed that S(-4) = (2E)|)C. Also see Calingaert (1960). 
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Equation 7.12c gives i(f) as a function of y(f, 4) and S(+). This relation 
is more commonly written in transposed form and with i(f) replaced by the 
equivalent y(f, Vv) as follows: 

WAY) = A) Vf 4). (7.18) 
Since S(4) is self-inverse (equation 7.16), premultiplication of equation 
7.18 by S(A) % yields an identical expression for y(f, 4) as a function of 
y(f. Vv), namely 7 

Vf, 4) = SA) YL). (7.19) 

The characteristic vectors of the identity function 1 and of the zero 
function 0 may now be derived. Clearly i(1) = € and i(0) =€. Hence by 
equations 7.13 and 7.14, 

yl, v) = yl, A) =€, 
and 

yO, v) = ¥0, A) =e. 
Moreover, 

y(1, A) = S(A) ve 


= £/S(#). 
Hence, by equation 7.17, 
yU, A) =e. (7.20) 
Similarly, y(0, A) =e. 


The relations between the characteristic vectors of a function f and of its 
inverse f may now be obtained. Since 


i(f) =i(f), (7.21) 
then, by equations 7.13 and 7.14, 
yf Vv) =¥U, V), 
and yf A) = ¥(f A). 
Moreover, 4 A) = S(A) Sif) = SCA) 4 ICY), 
by equations 7.13, 7.19, and 7.21. Hence 
vf 4) = (vf 4) # ©) (7.22) 


by equations 7.7, 7.17, 7.19, and 7.13. Characteristic vectors of a function 
and of its inverse are displayed in Table 7.2. 

The relation between y(f, #) and y(f, =) may now be obtained by 
applying equation 7.22 to equation 7.12c to yield 


if) = i(f) = (YA) 4°) % SCH). 


if) 
yf. #) 
vf =) 
i(f) 
V4, #) 
vf, =) 


i(f) 


i(f) 
SA) AUP) 
SM=)7 UP) 
i(f) 

SA) FIP 
S(=) 7 1) 


vf, ¥) ¥(f, =) i(f) Wf #) 
SAA 4) |S ])FVK =] W/ S(=) 7 #) 
vf, #) (Wf, =) =) SCA) RIS) | OK 4) # ©) 
Wf 4) =) | vf =) S=) i) | 4 * 
SHILA |SA*IGZ=) | S(#) * v4, 4) 
Wf #) #8) | ¥f =) SAFI [YK A 
Wf, 4) Wf =) #€) S(=) Fi) | WU =) #0) 
Q) ¥fv) =xfa) =i) 0] 1 
(2) S(4,n) = (TH) ) TH) iif) aa ie 
S(A,n — 1) | E ¥fiv) | € € 
“(on a Wf A) | € | € 
¥f#) | € e° 
¥fi=) | & | « 


s 1 0 
3) M4, 1) = ( H 


(4) S(=) = S(#) 


Zero and identity functions 


Table 7.3_ Relations among characteristic vectors 


YF =) 


S(#) *¥K =) 
¥f, =) 

(v, =) 4 &) 
S(=) 5 14 =) 
(vf, =) =) 


yf, =) 


et 


PST 


gsnjn2jo02 109180] ay T, 


TLS 
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Application of De Morgan’s law and of the fact that S(=) = S(+4) yields 
if) = (VU, 4) Ae) v S(=). 


Comparison with equation 7.12d shows that 


Yh =) = (Vf #) # €) 
= (y(f, #) = €°). 
The relations among the various characteristic vectors are summarized 
in Table 7.3. 


7.3. DECOMPOSITION 

A logical function f(*) is said to be decomposable on u if it can be written 

in the form = 
f(®) = gh(u/*), u/s), 

where g and / are logical functions. Since f, g, and A# are functions of 
v(w), (1 + +/u), and +/t variables, respectively, then if +/a@ > 1, both 
g and / are functions of fewer variables than f, Decomposition on wu such 
that +/u > | thus permits f to be expressed in terms of simpler functions 
g and fA and provides an important basis for simplification techniques. 

Every function is decomposable on u = €. Moreover, if +/# = 1, then 
u = é' for some /, and 

S(®) = [8 A f@ A €)) v [¥, Af Vv €)). 


Since both f(« A €') and f(* v €’) are expressible as functions of €’/x, then 
the foregoing expression is of the required form with A(u/*) = *,. Con- 
sequently, all functions are trivially decomposable for +/i# =0 or 


+/iz = 1. 
Disjunctive canonical form 


Ashenhurst (1957) determines nontrivial decompositions of f(x) by 
arraying the intrinsic vector i(f) in a 2+/* x 27! matrix F defined as 


follows: Pi = if), 
where k=(2e) | k 

i == (2e) | (a/k) 
and J = Qe) | (u/k). 


If, for example, u = (1, 0, 1, 1, 0), then the index k of each component i,(f) 
appearing in F is given by the matrix* C of Table 7.4. The table also shows 


* Ashenhurst (1957) calls the matrix C a decomposition chart and represents F by 
circling each element of C which corresponds to a nonzero component of i(f). 
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Table 7.4 Decomposition of the function f(x) on u 
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corresponding values of i( f) and F. 


\ 


at 


Let p= a x, 
q=T * (u/s), 
and r= T ’. (u/x) 


be the minterm vectors of x, u/x, and u/x, respectively.* Then the matrix 
P=q;r 


contains the components of p arrayed in the same order as the components 
of i(f) in the matrix F. Consequently, t 


f(s) = if): P= VIV I FAP). 


Decomposability depends on the structure of the matrix F. If each 
column of F is either zero or equal to the vector a, there exists a vector ¢ 
such that 


F=a?c. (7.23) 
Hence, f(*) = vivilM, 
where M = (afc) A (qf 1). (7.24) 
But by equation 7.8, M=(aAq)°(c AP), (7.25) 
and hence by equation 7.9, 
f(*) =V/VIIM = (VIG A OD) (VIE A 4) (7.26) 
=(4 iq) A(c%r). 


Since the first and last terms on the right of equation 7.26 are, respec- 
tively, functions of u/x and of u/x only, the function f(*) is decomposable. 
The required functions are simply 


h=h@i/xs)=a¥q=ai[T >@jx)) | 
; Vv | 
f(*) = g(h, u/x) =h A [ex rv] (7.27) 
r= T / (u/s) 
Since equation 7.27 does not represent the most general possible function 


of A and u/x, it appears that the characteristic matrix F = a; ¢ does not 


* Although denoted by the same symbol, the matrices T are of differing dimensions 
as required by compatibility. 

+ Since i(f) is equal to y(f, V ), it may be substituted for it in the disjunctive canonical 
form. 


258 The logical calculus §7.3 


represent the most general function decomposable on uw. Ashenhurst 
(1957) has shown that the most general type of function decomposable on 
u is represented by a characteristic matrix of the form* 


F=(a,b6)V(arevied)v(€? e), (7.28) 
where b, c, d, and e are mutually disjoint and collectively exhaustive, that 


is, 


b+eo+d+e=e. 


The fourth term of equation 7.28 is identically zero and is included only 
for formal completeness; hence 


F=(a°. 6) V (a: c)V(Ee8 d). (7.23’) 
Equation 7.24 now becomes 
M=((4,b)V(aneyvierd) AQ ir), (7.24") 


and since conjunction is distributive over disjunction, equation 7.8 may 
again be applied to yield 
M=(@Aq)° (BAT) VAAN g ie An) VEN Q) 2 (dA 2), 
(7.25’) 
Moreover, since v/V//(X v Y) = (v/v//X) v (v/v //¥), equation 7.9 
may again be applied to yield 
S(*) =v] Vv j/M 
SGA QAM VA DAC MN VESMAG AT). (7.26) 
Since q is a specific vector of the disjunctive canonical form (i.e., a 


minterm vector), it Is some column of the specific matrix S(v). Since 
S(v) =I, q therefore contains exactly one nonzero component, and 


consequently (4 Yq) = (a ; q), ande ¥ qg = |. Equation 7.26 can thus 
be rewritten in decomposed form as 


h = h(ujx) =a iq =a; (T % (4u/s)) } 
f(#) = gh, uls) =(h A (Bir) Vv (HA (eX) V (dX 8) C2r) 
r= T . (u/x) ] 


It is interesting to note that no use has been made of the fact that b, ¢, 
and d are disjoint. Relaxation of this restriction does not, however, 
increase the generality of the matrix F, since a vy € = €, a Vv € = a, and 
av a=e. It does suggest, however, that the matrix F may be expressed 
more compactly as 


a 


F=(a2m) v (a? n), 


* In Ashenhurst’s terminology, the matrix F must be of the following form: each 
column is either all zeros, all ones, the vector a, or the vector a. 
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where m = b v d and n =c vy d. The second line of equation 7.27’ 
then becomes 
f(x) =(h A (m= 4) Vv (A (nm © 1)), (7.28) 


a reflection of the obvious fact that the third term (d « r) of equation 7.27’ 
can be incorporated in the preceding terms. 

Table 7.4 shows a complete example of the decomposition process for 
u = (1, 0,1, 1,0). The characteristic matrix F is obtained by applying 
the matrix C to i(f). Clearly 

F=(a°b)V(aic)V (exc) 
=(a°m)V(a" ny). 
Consequently, 
f(s) = (h A (bn) v (AA (E31) Vv (d 20), 


where h=a:q, q=T (u/x), andr=T > (u/x). 


Since Gd = (¥, A By, ¥y A %y, 4%) A Fy, ¥, A ¥4), 
and a = (1,0, 0, 1), 

then h = (%, A %4) V (%, A %4) = (%, = 4). 
Similarly, bir = (Ky A A %) V (Ky A & A ¥,) 


= %, /\ (%) F ¥3), 
Cir = (Ky A %) V (% A %y A 3), 
and dj} t= % A By A By. 
Alternatively, the use of the vectors m and 7 yields the solution 
S (8) = {AA [By A % A 5) V (%q A Hs) V fA [By A %) 
V (% A (% = 43))]}. 


The entire decomposition process is described by Program 7.5. Steps 
1-7 determine the characteristic matrix F appropriate to the decomposition 
u. The loop 2-7 is repeated for each value of A from zero to 2"? — I. 
Step 2 determines k as the vector (of dimension »(u)) whose base two 
value is k. Steps 3 and 4 then specify the indices / and / appropriate to k, 
and step 5 specifies element F,’. 

Step I] determines d as the vector which specifies all full column vectors 
of F, that is, d; = | if and only if F; =. Step 12 determines e as the 
corresponding vector specifying the zero columns. 

If d and e together exhaust the columns (that is, d vy e = €), then b and 
c (and a arbitrarily) must be set to zero. Since this is done by steps 8-10, 
the exit branch on equality at step 13 terminates the process correctly. If 
(d v e) #e, then any column of the matrix (d v e)/ F can be used to 
specify a; step 14 uses the first column. Step 15 determines 6 as the vector 
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a , 
1—+>k<-0 -origin indexing 
—» (2 OR < 
? (Zee) ‘ f | Logical function of »(t2) variables. 
3 i — (Qe) (uk) u | Logical vector specifying the de- 
4 ; < (2e) (a/R) composition 
] a f(x) = g(A(u/s), u/s). 
5 Be bP) F | Characteristic matrix for the de- 
6 k<-k+1 composition tu: 
= otfu- = 21 /u 
aS] gp. 2 WP) = 277%; (Ry = 251, 
8 i | Row index for F. 
8 a e(+/u) 7 | Column index for F. 
9 b<—e(+/u) k | Scan index for i( f). 
10 c + &(+/u) k | Base 2 representation of &. 
11 d«-e*F a | Nontrivial column of F. 
- b | (b/F),; =a. 
<e/ F 
12 e ef c (c/F); =a, 
13 e: (dv e) [=_, d | (d/F); =. 
14 a~<((d V e)/F), e | (e/F); =€. 
15 b<~GAF 
Legend 
16 c<ahF 
17 “Je :(vecydy ej >. 


Program 7.5 Decomposition of fon u 


specifying the columns of F which are equal to a, and step 16 determines ¢ 
correspondingly for the vector @. 

The function fis decomposable on wu if and only if b, ¢, d, and e together 
exhaust all columns of F. The left-pointing exit on step 17 therefore indi- 
cates nondecomposability on u. 

The algorithm can be extended to test all possible values of u successively 
and so determine all possible decompositions. Use can be made of the 
obvious fact that the matrix F appropriate to decomposition on @ is the 
transpose of the matrix F appropriate to wu. 


Other canonical forms 


Ashenhurst (1957) remarks that decomposability is an intrinsic property 
of a logical function f and is independent of the form of its representation. 
It can also be shown that the particular algorithm of Program 7.5 is intrinsic 
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in that it applies (in a slightly generalized form) to the characteristic 
vectors of all four canonical forms. 

Since i(f) = y(f, A) = y(f, v), it is clear that the decomposition 
algorithm applies directly to the disjunctive and conjunctive characteristic 
vectors. The case for the remaining forms will be developed for the 
exclusive disjunctive form only. 

Let F be defined as in Program 7.5 but with i(/’) replaced by y(f, 7). 
Moreover, let 


p=Tisx 

q = T \ (u/s) 

r= T ¥ (u/s), 
and P= qT. 


Then, clearly f(*) = A/A//F A P). 

As in the case of the intrinsic vector, the structure of F must be expressed 
in terms of the characteristic vectors of a given function 4, of its inverse h, 
and of the identity and zero functions. Jn the exclusive disjunctive form, 
y(l, 4) = &°, (0, 4) = €, and if y(4, 4) = a, then y(4, 4) = (a  €°). 
The term in € may again be disregarded and the form required of F for 
decomposability may (following equation 7.23’) be written as 


F=[(a4e)? bl Vv (a2c)V (2d), (7.23”) 


where b, c, and d are mutually disjoint. 
The matrix M such that f(*) = 34/+4//M may now (as in the analogous 
case of equation 7.25’) be obtained by applying equation 7.8: 


M=((AZO)AQLOAN)V (AN QS(CAN) 
Vv (eA g(a A 4), (7.25") 
Equation 7.25” will also be written as 
M=UyVvyv W, 
where U, V, and W denote the successive matrices of the right limb. 
Since b, c, and d are disjoint, so also are the matrices U, V, and W. For 
any pair of disjoint matrices X and Y, it is easily shown that 
ALA X VY) = (A/A//X) H (A/AI/¥)). 
Hence 
S(#) = AlAl/M = {{(A/4//U) & A/4//V)] A ALALW)}. 
The application of equation 7.10 to each of the matrices U, V, and W now 
yields 
AM = (MA FOYADABAMACAADA (C4 7)] 
ACE AGA (dn 1). (7.26") 


262 The logical calculus §7.3 


Since a, (a ~ €°), and €° are the characteristic vectors of the functions 
h, h, and 1, respectively, equation 7.26” may be written in the decomposed 
form 


h@u/x) = a ,(T ¢ (u/*)) 
Se) = (HA (BARN AMA (EAM) AAAN)} (7.27) 
r= T ) (u/s) ) 


For the example of Table 7.4, 
yf #) =, 1,1, 1,0, 0, 0, 1, 1, 0, 1, 0, 0, 0, 1,0, 0, 1,0, 1, 1,1, 0, 1, 1, 0, 1,0, f, 0, 1, 0). 


Table 7.6 show its decomposition in the exclusive disjunctive form. 


a (a ¥€°) 
1 100 00 1 0 1 0 
1 tod td tt 1 1 1 1 
lec 1101 td t ft 1 1 1 
000000 0 0 0 0 


d=(0 000000 0) 


Table 7.6 Decomposition in exclusive disjunctive form 


The only change required in Program 7.5 1s in the specification of the 
vectors d and b on steps 11 and 15. These may be replaced by the following 
steps: 

I) d<&4F 
15 b<(afe)" F. 


The program may be made completely general (i.e., for y(f, ©)) by simply 
replacing e° with y(1, ©) in the foregoing steps, and replacing € with 
y(0, ©) in steps 8, 9, 10, and 12. 
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EXERCISES 


7.1 Use De Morgan’s laws eee 7.2 and 7.3) to establish 

(a) the identity U V = UY i y. 

(b) the identity U 7 V = Uy Y. 

(c) equation 7.4. 
7.2 Show that the expressions x = (y # z) and y = (x & 2) are equivalent. 
7.3. For the function f(%9, *,, 2) = (% A (*, # %2)) V (Gq A %y), derive 

(a) the intrinsic vector i(f). 

(b) the four characteristic vectors of f. 

(c) the intrinsic vector of f. 

(d) the four characteristic vectors of f. 
7.4 Use the matrix © of equation 7.1 to summarize the canonical form expres- 
sions (equations 7.11 (a—d)) in a single equation. 
7.5 Show that y(f, A) = ¥(f, A). 
7.6 Use De Morgan’s laws and the result of Exercise 7.5 to derive equation 
7.11b from 7.1 1a. 
7.7 Show that ify A 2 = 0, then 

X4(y v2) =(X 7 y) 4 (X72). 

7.8 Let fand ¢ be two disjoint functions (i.e., f(*) A g(*) = 0 for all »), and 
leth = f Vv g. Derive expressions for the four characteristic vectors of A in terms 
of the four characteristic vectors of fand ¢ 


7.9 Each of the sixteen logical functions of two variables may be characterized 
by its intrinsic vector i(f}) and be denoted by 


f@,y) = B, iCf), y). 


For example, (x 4 y) = A(x, (0, 1, 1,0), ¥). (The function A(z, i( f), y) is some- 
times called the kth Boolean function and denoted by f,, (x, y), where k = Li(f).) 
This notation can be extended to vectors x and y so as to permit different func- 
tions to be specified for each component. Thus 


& — B(x, Fy) <=> 2; = B(x, F,, ¥,), 


where »(F) = v(x) and n(F) = 4. Show that 
(a) if x = (0,0, 0, 0,0, 0,1, 1), » = (0, 1, 0, 1, 0, 1, 0, 1), 
and IL F = (0, 0, 0,0, 6, 6, 6, 6), then f(x, F, y) 
= (0, 0, 0, 0, 0, 1, 1, 0). 
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(b) B(x, F, y) = B(*, F,¥) 

(c) B(x, F Vv Gy) = B(x, F,y) Vv B(x, G, ¥) 

(d) B(x, FA G,y¥) = B(x, F,y) A B(x, G,¥). 
7.10 The functions f(x, F, y) defined in Exercise 7.9 can alternatively be ex- 
pressed as f(x, f,y), where f all F. Develop relations on f corresponding to 
those of Exercises 7.9 (b—d). 


SUMMARY OF NOTATION 


S.1 CONVENTIONS 


Basic conventions 


(a) l-origin indexing assumed in this summary. 

(b) Controlling variables appear to the left, e.g., u/x, b | y,k + x, and 
ul x. 

(c) Dimension n may be elided (if determined by compatibility) from 
e(n), €*(n), a*(n), w"(n), and U(x). 

(d) The parameter / may be elided from operators |,, 9;, [;, and ¢, and 
from the vector v if / is the index origin in use. 

(ec) The parameter & may be elided from & } w if k = 1. 


Branching conventions 
# 
(a) |x ly es: 


The statement to which the arrow leads is executed next if (r#y) 
= 1; otherwise the listed successor is executed next. An unlabeled 
arrow is always followed. 


(b) xiy, ros 


The statement numbered s, is executed next if (xr;y) = L. The null 
symbol ° occurring as a component of r denotes the relation which 
complements the disjunction of the remaining relations in r, 


(c) —» Program a, b 
Program a branches to its statement 6. The symbol a may be elided 


if the statement occurs in Program a itself. 


Operand conventions used in summary 


Scalar Vector Matrix Tree 
Logical u,v, WwW u,viw U,V, Ww U,V, W 
Integral hi, j,k h, i,j,k HI, J, K H, I, J, K 
Numerical ry, 2 x,y, % X,Y,Z X,Y, Z 
Arbitrary a,b,c a,b, e¢ A,B,C A, B, C 
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S.2 STRUCTURAL PARAMETERS, NULL 


Dimension 
Row dimension 


Column dimension 


Height 
Moment 
Dispersion vector 


Moment vector 


Degree of node i 
Degree 


Leaf count 


Row dimension 
of file 


Column dimension 
of file 


Null character 


v(a) 
(A) 


H(A) 
v(A) 
H(A) 
v(A) 
u(A) 


o(i, A) 
O(A) 
A(A) 
W(@) 


H®) 


Number of components in vectora@ $1.5 
Number of components in each row 
vector A’ 
Number of components in each 
column vector A; 
Length of longest path in A $1.23 
Number of nodes in A 
yv,(A) = number of roots of A; 
v(A) = maximum degree of nodes 
on level j — 1; »(v(A)) = »(A) 
(A) = number of nodes on level / 
of A; r(4(A)) = (A) 
Degree of node i of tree A 
O(A) = max 6(4, A) 
i 
A(A) is the number of leaves in A 
Number of files in each row of a $1.22 


file array 

Number of files in each column of a 

file array 

Null character of a set (e.g., space in §1.3 
the alphabet) or null reduction 

operator 
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S.3 RELATIONS 


Equality a=b aand 6 are identical 81.15 
Membership aeb a = 6, for somei 
Inclusion bDa a;¢b for alk 

acb 
Strict inclusion b>a b>aanda pb 

acb 
Similarity b=a b>aandaZDab 
Complementary x The relation which holds if and only 
relations if # does not. Examples of comple- 

mentary pairs: «,¢; >, p; °-, +. 

Combined (ored) A list of relations between two vari- 
relations ables is construed as the or of the 


relations. Thus x C > y is equivalent 
to x + y. When equality occurs as 
one of the ored relations, it is indi- 
cated by a single inferior line, e.g., 
<andc. 


S.4 ELEMENTARY OPERATIONS 


Negation w<u w=I[<>u=0 $1.4 
And we—uAv w= l<>u=landr=1 
Or weoeuVer wel<+u=lorv=l 


Relational state- w<-(a4b) w = 1<> the relation a-# b holds 
ment 


Sum z<-v+y zis the algebraic sum of x and y 
Difference z<«—y zis the algebraic difference of « and y 
Product z<-x xy zis the algebraic product of numbers 
z< ay cand y, and c is the arbitrary character 
c«-a@ Xu aor zero according to whether the 
Cc «au logical variable w is one or zero. 
Quotient z«-x > y zis the quotient of.v and y 
Absolute value z<|e| z= x [(w#> 0) -—(x < 0)] 
Floor k <{[x] kak +1 
Ceiling k [x] kon ek -1 


j-Residuemodh k<-hlji i =hgtks jf ok <j +h; andgqis 
integral. 
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8.5 VECTOR OPERATIONS 


Component-by- 
component exten- 
sion of basic 
operation 

Scalar multiple 


Reduction 


Ranking 
/-origin 6-index 
ofa 


j-origin b-index 
ofa 

Left rotation 

Right rotation 

Base ¥ value of x 


Compression 


Expansion 
Mask 
Mesh 
Catenation 


cH#aob 


Bw xLXKXYV 
B+ ry 
ce~a xu 
c~-au 


< Ofa 


c+bija 


c«+bua 


c«kfa 
c<kla 


z—y lex 


c~«~u/b 


c<~uib 
c <-/a,u, b/ 
c <-\a,u, b\ 


e<acb 


c; =a; O 6;. Examples: x x y, $1.5 
(a/b), Al, iu v, [x]. 


ZB =u xXy,andec, =a xu; 


c=C("° ((a, Oo ay) © as) 7) O4,), 81.8 
where © isa binary operator or rela- 
tion with a suitable domain. Examples: 
+/*, x/*, #/u. Reduction of the 
null vector €(Q) is defined as the identity 
element of the operator ©. Examples: 
+/e(0) =0; x/e(0) = 1, v/e(0) = 0, 
A /e(O) = 1. 

$1.16 
c =olfagb; otherwise c is the j- 
origin index of the first occurrence of 
ain b, 
c¢; = bia; 


c; =a,, where j = v(a@) |, G + 4) $1.6 
c; = a,, where j = (a) |, G@ — k) 
z=4/(p x *),wherep, =I!,and 981.14 
Dy. = Dy Xx Y; 


cis obtained from a by suppressing 81.9 
each 6, for which u,; = 0 


u/c =0,ujc =b 

ule =uja,uje =u/b 

ule =a,ujc =b 

C = (44, 4g, ... Aya). Oy, - . - By) = 
\a, o'), b\ 
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S.5 VECTOR OPERATIONS (continued) 


Characteristic of 
xony 


jth unit vector 
Full vector 
Zero vector 


Prefix of weight / 
Suffix of weight j 


Maximum prefix 


Maximum suffix 


Forward set 
selector 


Backward set 
selector 


Maxima selector 


Minima selector 


Interval or 
j-origin 
identity permu- 
tation vector 
j-origin permu- 
tation vector 


j-origin mapping 


j-origin ordering 


& 
w< ey, 


w <—e(h) 
w <— e(/) 
w —e(h) 
w<—0 

w <al(h) 


w <—w'(A) 


w<- xf 


w<—ow/u 


wala 
w <-/a 
w~—ufx 
w<«ulx 


k <—v(A) 


k 


Cc + Ay 
c~«bjja 


k<—0;[x 


w, =(y;e x): o(w) = ry) $1.15 
w, =(i =/) 81.7 
w, = 1 
w,=0 

Ww) =h 


First & of w; are unity 
where & = min(j, A). 

Last k of w; are unity 
where k = min(j,h). J 


w is the max length prefix in t2. 81.10 
Example: 

a/(1, 1, 0,1,0,1) =(1, 1, 0,0, 0, 0). 

w is the max length suffix in w. 

Example: 

o/(1, 1,0, 1,0, 1) = (0, 0, 0, 0, 0, 1). 

w, = lifa; 4a;forallj <i 


w, = lifa; £a;forallj >i 


w, =u, A (x; = m) where 81.18 
m = max (tt/%); 
j 
w, =u; A (x; = mt) where 
m = min (t4/x); 


J 
kR=(j+l.....j+h—-l 81.7 


k = V(v(k)) 81.17 


c,; = cif b, ¢ V((a)); otherwise 

C; = @, ina j-origin system for a. In 
the first form (that is, c <- a@,), the 
origin cannot be specified directly. 

y = kJ,x is in ascending order and 
original relative ordering is main- 
tained among equal components, that 
is, either y; <y,,, Ory; = ¥;,, and 
R; < Ry. 
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S.6a ROW GENERALIZATIONS OF VECTOR 


OPERATIONS 
Z<-XOY Zi =X; OY; $1.5 
Ba O/X By = O/X $1.8 
C+- C/A C= A, Ag Gites 81.15 
ise Oo) A, 
M-+Bi,A M = B',, Ab $1.16 
C-ktA C' =k, } At $1.6 
CRA Ci =k, | A 
z<«-Y LX z,=Y' _ x 81.14 
CA, C; = Ap, $1.17 
C<«-Bj,A Ci = Bj, At 
K «-0,/X Ki = 0),/X? 
C+AGB Ci =A’. Bi $1.9 
C <«-u/B C= u/B 
c <«- U/B e = UVB... 
“+ UNYBE 
C«-u.B u/C =0,u/C = B 
C-«Ub U/C =0, U/C =6 
C — A, u, By u/C = A,u/C = B 
C — a, U, b, U/C =a, U/C =6 
C ~-/A, u, B/ ujC = u/A,u/C = u/B 
C «- /A, U, B/ U/C = U/A, U/C = U/B 
C «-/a, U, 6] C = /E\a, U, E\b/ 
W <2/U Wi = x/U! Shtg 
W ~o/U W? = w/U 
W <—o/U Wi = o/U! 
W <—;/U Wi = 7/U' 
W <- UX Wi = UTX? 81.18 


W <~U[X Wi = U[Xi 
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S.6b COLUMN GENERALIZATIONS OF VECTOR 


OPERATIONS 

Z<XOY Zi =X; oO Y; 81.5 
z+— O//X &; = O/X; $1.8 
C<@//A CHA GOAE::- §L.15 

+ @) Al 

M<+Bu,A M; = B;u, A; $1.16 
C+khA C; =k; A; $1.6 
C~RYA C; =k; | A; 

e-VYiUX 6 Ye §1.14 
C «A° Ci = A $1.17 
C~-BSf,A C; = B; J, A; 

K < 0,//X K; = 9,/X; 

C+A@OB C; = A; @ B; §1.9 
C «-ul/B C; = u/B; 

c~«U//B c = U,/B, @::: 

+++ @ U,/B, 

C~«~u\\B u//C = 0, u//C = B 

C + U\\b Oj//C =0, U/C =b 

C + \\A, u, B\\ u//C = A,u//C =B 

C + \\a, U, B\\ U//C =a, U/C =b 

C-|/A,u,Bi|  — a//C = ul//A,ul/C = ul/B 

C + |/A, U, B]/ U//C = U//A, U/C = U//B 

C <-//a, U, b// C = /E\\a, U, E\\b/ 
W ~—2//U W, =/U, $1.10 
W ~—w//U W; = 0U; 
W <o//U W; =o/U; 
W «-//U W, = -/U; 
W « Ul[X W, = UX; $1.18 
W ~ UILX W, = ULX, 
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S.7 SPECIAL MATRICES 


Full matrix 


Zero matrix 


Superdiagonal 
Identity 
Upper left 
(triangle) 
Upper right 
Lower left 
Lower right 


W ~—E(p,q) 
WwW — E(p, q) 
wWw<0 

Ws Meg) 
W <—I(p,q) 
W<— Ep. 9) 


W—T(p.49) 
W-—-D (p,q) 
W<—-pq) 


wi 
Wi =0 


WiaGt+k=/) 


W = I(p,q) 
Ww, 
Wia-iba(itj< ™ 


ane m = min (p, 9) 


et1—7 
Wwe 


S.8 TRANSPOSITION 


MW) =p 
v(W) = gq, for 
pandgq 
integers, 
Elision of 

p and gq if 
dimensions 
determined 


by 


J compatibility. 


Diagonal C<-B Ci = ) 
\ 
C<-B 
3 (B)+1—4 i 
Counter diagonal C<-B Cipriani =P Bi 
— 
Horizontal C<-B CHBMINE ge 
' 
Vertical C S Cpa 7 = J 
Vector your i= Xa; 
i 
yore 


S.9 SET OPERATIONS 


Intersection c«bNa c=e,"/b 

Difference c<~bAa c=€,"/b 

Union e<«bUa c=bOQ(aq b) 

Cartesian C-b1® Chet ®) = (By,', Bg, . «+. Onn); 
product 2b" d; =(b/); 1 <k; <d; 


Clearly, r(c) =n, and y(c) = x/d 


§1.13 


§1.12 


81.15 
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S.10 GENERALIZED MATRIX PRODUCT 


C<-A&IB 


C;' = O1(A" ©, B;), where ©, produces a vector §1.UI 
(i.e., is not the operator -L), and ©, is a reduction 
operator (and hence C;! is a scalar). 


C! = (A’ © b), where © is any operator which 
produces a vector of dimension »(b). 


C; = (a © B,), where © is any operator which 
produces a vector of dimension »(a). 


A representation of aofthe form §1.22 
(Pi, qa), Po, ay, peng. y(a)s Py) +h Os 

Pa) Dy eens Pyp))s where PDP) Ms the 
partition at position /, py = Pup) = 4, 
and(a! A w)/p < A. 


C<-AOb 

C~-a O B 

C<a0b Ci = (a; © by). 
8.11 FILES 

File ®,; 

Position file n(®,?) <-h 


Record (from position /) 


Forward 


Backward 


oP; —a, Ar 


Pj) © a, ry, 


Read (from position /) 


Forward 


Backward 


File array 
Full 


Row 
Column 


Compression 
Row 


Column 


a, b<— oP; 


a,b <—,®," 


¢)) 


(pe 
,; 


u/® 
u//P 


Set file to position A. Called rewind 
iff = l,and windifh = v(p). 


Gy, <4, Py .4 <3 Stop at position 
h +1. Zero prescript may be elided 
and A, may be elided. 

G1 ~—@; Py_1—A,5 Stop at 
position — 1. A, may be elided. 


a+«-d,; b<—p,,4; stop at position 

ht +1. Associated branch is controlled 
by P,.1, and 6 may be elided. Zero 
prescript may be elided. 

a<- 4,1; b<—p,_1; stop at position 

h — 1. Associated branch is con- 
trolled by p,_4 and 6 may be elided. 


Array of files ®,', for fe ul(()), 
Je vOM)). 

Row of files ®;*, for je UG()). 
Column of files ®,*, for ie UUd®)). 


Selection as in corresponding opera- 


tions on matrices. 
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S.12 TREES 

Path i c<-Al c, is the i,th root of A; c, isthe ith §1.23 
node of the nodes on level j reach- 
able from node c;_,. 

Node i c «-(A‘),, The final node of path A‘. 

Subtree i C <A; C is the subtree of A rooted in node i. 

Component-by- C-A OB (Ci) = (AD qa © (B), (i). 

component 


Path reduction C< ©O/A — Reduction by operator or relation © 
on nodes in left list order. 

Level reduction C-<« ©//A Reduction by operator or relation © 
on nodes in right list order, 

j-origin b-index B<byA (Bi) = 84; (Aa) 

j-origin mapping C<-6f;A Rooted subtree C, is a single null 
character node if b; ¢ U(u,(A)); other- 
wise C; = A,,, where A is treated ina 

j-origin system. 

Full right list C<-jA ] The rows of the index matrix a?/C are 

matrix | the right (left) justified index vectors 

{ (with null fill to the common 
dimension v(A)) arranged in increasing 

Full left list C<-[A | order; C, and C, are the correspond- 

matrix | ing degree and node vectors of A. 

Right list matrix C <a?/JA| The degree and node vector columns 

Left list matrix C<— an of the full right (left) list. 

Tree compression C <- U/A C is obtained from A by suppressing 
node i if node # of U is zero and re- 
connecting so that for each remain- 
ing pair of nodes, the one lies in the 
subtree rooted at the second if and 
only if it did so in A. 

Path compression C <-u/A C is obtained from A by suppressing 
all nodes on level / if u; = 0, and re- 
connecting as in the compression U/A. 

Level com- C~«—u//A Cis obtained from A by suppressing 

pression rooted subtree A; if u; = 0. 

Level mesh C<~\\A,u,Bi\ u//C =A; u//C =B 

Level mask C<=-//A,u,B// a//C =u//A; u//C = u//B. 

Path catenation C<-A@ B Cis obtained by connecting roots of 
B to leaves of A, allotting successive 
groups of at most [y,(B) = A(A)] 
roots of B to each successive leaf of A. 
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S.12 TREES (continued) 


Full tree 


Zero tree 


Path tree 


Level tree 


Maximization 
Minimization 


Maximum path 
prefix 


Maximum path 
suffix 


Forward path 
set selector 
Backward path 
set selector 
Maximum level 
prefix 
Maximum level 
suffix 

Forward level 
set selector 
Backward level 
set selector 


Ww-E 

W «- E(k) 
W-E 
w<0 

W < E(k) 
Ww «UE 
Ww & “E(k) 
WE 
W ~ ,,E(R) 


W-U[A 
W«-U[A 


W «-a/U 


W<« o/U 


W <-G/A 
W«-7/A 
WwW: «//U 
W <-o@//U 
W <—ol/A 


W <-7//A 


Each node of W is unity and the $1.23 
structure of W is determined by 
compatibility. 

Each node of W is unity; W is homo- 
geneous (i.e., all nodes on any level 
have a common degree) and v((W) = R. 
Each node of W is zero and the 
structure of W is determined by 
compatibility. 

Each node of W is zero; W is homo- 
geneous and v(W) = R. 

u/W =0; u/W =E; structure of W 
determined by compatibility. 

u/W =0; u/W =E; Wis homo- 
geneous and W(W) = k. 

u//W =0,u//W =E:; structure of W 
determined by compatibility. 

u//W =0; u//W =E; Wis homo- 
geneous and v(W) = k. 

W =UA (A = mE), where nis the 
maximum (minimum) over all nodes 
of U/A. 

W is obtained from U by zeroing all 
nodes of every subtree rooted ina 
zero node. 

W is obtained from U by zeroing 
every node which contains a zero 
node in its subtree. 

(W'), 3, = 1 if (A), differs from all 
preceding nodes of path A’. 

(Wh) = Lif (AD, ; differs from all 
other nodes of its subtree. 

6 /W = a/e’/U 


ei/W = w/e/U 
6//W =a/e/A 


@}/W =7/e/A 


Index 


0-origin indexing, 71, 186 Automatic programming, 71 
0-residue, 14 Auxiliary variables, 38 
1-origin indexing, 12, 16 Axis of transposition, 25 


1-residue, 14 
Backward chain, 113 


Abelian group, 69 Base address register, 74 
Accumulator, 78 conversion, 190 
index, 73, 79, 83 value, 27, 38, 146 
Active domain, 148, 153 Batching, 138 
Add, clear and, 76 Betz, B. K., 198 
Additive indexing, 73 Binary operation, 13 
Address, 72 search, 141, 155, 221, 236 
leading, 106 Birkhoff, G., 26, 28, 32 
table sort, 176, 213, 231 Biunique transformation, 144, 146 
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Controlled scan, 135, 141, 147 
Controlling parameter, 8 
Conventions, 7 
Conversion, base, 190 
Convert instructions, 87 
Copy operation, 177, 189 
Counter, instruction, 73 
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identity, 253 
inverse, 37 
key, 176 
magnitude, 11 
mapping, 34 
program, 159, 169 
sign, 11 
trivial, 248 
unit, 248 
zero, 248, 253 


Gap, record, 98 
Garner, H. L., 67 
Gassner, B. J., 239 
Generator, 159 
Gilstad, R. L., 198 
Goldstine, H. H., 1, 178 
Graeffe method, 65 
Graph, directed, 46 
index, 48 
Gray code, 63, 104 
Greek letters, 9 
Grid matrix, 106, 118 
Group, abelian, 69 
rotation, 26 


Hanan, M., 154 

Harary, F., 132 

Hardware designer, 71 

Height of a tree, 47 
Homogeneous tree, 58, 121, 226 
Huffman code, 51 

Huffman, D. A., 123, 129, 132 


Identities, logical, 246 
Identity function, 253 
matrix, 26 


Identity, permutation vector, 33 
Immediate branching, 181 
Implicit allocation, 108 

specification, 38 
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Internal sorting, 176, 204, 205, 212, 231, 
239 
Interpreter, 160 
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chained, 110, 116, 135, 147, 221 null, 25, 28 


linear, 108, 110, 118 
nonlinear, 109 
of matrices, 119 
of trees, 121 
partitioned, 115, 118 
solid, 106 
Residue, 12 
Revision operation, 183 
Rewind, 90 
file, 41, 183 
Richards, R. K., 63 
Right list, 226 
Ring, 101 
Riordan, J., 69 
Roman letters, 10 
Root, 46 
vector, 52 
Rooted scan, 135 
tree, 47 
Ross, 1. C., 132 
Rotation, 33 
group, 26 
left, 8, 14 
Row-chained matrix, 120 
Row, compression, 18, 21 
dimension, 14 
expansion, 21 
list, 22, 108, 119 
mapping, 33 
mask, 21 
mesh, 20 
of files, 43 
operation, 8, 18 
reduction, 17 
vector, 14 
Run signal, 81 


of equivalence class, 147 
rooted, 135 
shortest, 139 
to nearest terminal, 139 
Schay, G., 154 
Search, 133 
binary, 141, 221, 236 
overflow, 149 
single table, 149 
Selection, 8, 17, 32, 38 
set, 30 
vector, 22 
Selector set, 23, 58 
Self-indexing, 40 
Sequence, of execution, 2, 5 
register, 73 
Serial access, 41, 135 
sort, 176, 199, 204 
Set operations, 28 
selector, 23, 58 
Shaw, J. C., 110 
Shift instructions, 85 
operation, 78 
Sign function, 11 
Similar vectors, 29 


Simple classification, 177, 205 
classification and merge, 178, 200, 206 


merge, 177, 191, 205 
Single table search, 149 


Single-phase merge, 179, 183, 206 


vs two-phase, 200 
Singular formula, 164 
tree, 47, 60, 165 
Skeleton program, 160 
Skip, 75, 80 
Solid representation, 106 


Sort, address table, 176, 213, 231 
amphisbaenic, 191, 195, 203 
block, 191, 213 
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permutation, 58 
reduction, 58 
representation, 121 
rooted, 47 
singular, 47, 60, 165 
uniform, 58, 145 
Triangular matrix, 26 
Trivial functions, 248 
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vs single-phase sort, 200 


Uniform tree, 58, 145 

Union, 28 

Unit function, 248 
vector, 15, 30 
storage ratio, 212 

Univac, 106 

Utility program, 160 
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